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A  LINEAR  PROGRAMMING  MODEL  FOR  DESIGN  OF 
COMMUNIC  ATIONS  NETWORKS  WITH 
TIME  V  ARYING  PROBABILISTIC  DEMANDS 
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uncsimoni  cosis  .ire  assumed  known  tor  iw<>  kinds 
ot  cHiiipnieni  spicks  cmplovevl  to  supply  iclccommumcaiions  services  uunks 
and  switching  Ltulilics.  \  network  hierarchy  is  defined  which  includes  impor¬ 
tant  cases  occurring  in  the  field  and  also  appearing  in  the  literature  A  dilVerent 
use  o|  i he  classical  concept  of  the  marginal  capacity  o!  an  addilional  trunk  at 
prescribed  blocking  probability  leads  to  a  linear  programming  supply,  model 
which  can  he  used  to  compute  the  m/cs  of  all  the  high  usage  trunk  groups 
I  lie  si/es  ot  tile  remaining  trunk  groups  are  approximated  by  the  linear  pro¬ 
gramming  models,  but  can  be  determined  more  accurately  by  alternate 
I  methods  once  all  high  usage  group  m/cx  are  computed  I  he  approach  applies  to 

larger  scale  networks  than  previously  reported  in  the  literature  and  permits 
duec  application  ol  the  duality  theory  ol  linear  programming  and  its  sensitivity 
atiilw-s  to  the  study  and  design  of 'switched  probabilistic  communications  net¬ 
works  with  multiple  busy  hours  during  the  day  Numerical  results  are  presented 
lor  two  examples  based  on  held  data,  one  ol  which  having  been  designed  by 
tin*  multi-hour  engineering  method  . 

f 

I.  INTROIH  CTION:  A  DKSK.N  NTHKSIS  PROBLEM 

In  ihiN  paper  we  treat  telecommunications  networks  where  customer  demands  tor  service 
are  specified  probabilistically  between  pairs  of  junctions  according  to  dilVerent  hours  of  the  day. 
Telephone  traffic  may  How  over  a  direct  route  which  joins  two  distinct  junctions  or  over  an 
alternate  route  which  is  defined  in  prespeeilied  network  routing  hierarchy.  Networks  which  per¬ 
mit  alternate  routing  of  trallie  are  termed  sninlwil  because  switching  operations  are  required  to 
alternately  route  a  call.  I  he  network  routing  hierarchy  permits  traffic  which  is  blocked  on  a 
direct  route  to  he  switched  through  other  junctions.  The  switching  process  tends  to  smooth  out 
the  peaks  of  trallie  loads  which  occur  throughout  the  network  at  dilVerent  times  of  the  day. 
Consequently,  less  equipment  may  he  required  to  service  the  overall  trallie  load  on  the  network 
than  lor  a  similar  network  without  alternate  routings. 

An  example  of  a  network  routing  hierarchy  is  given  below  in  l-'igurc  I.  It  consists  of  junc¬ 
tions  I  through  //  and  two  dilVerent  kinds  ot'  links  joining  certain  pairs  of  junctions.  A  link  is 
merely  a  dimensionless  entity  whose  existence  indicates  that  telephone  calls,  collectively  termed 
truth i.  may  flow  in  either  direction  between  the  two  junctions  which  it  joins,  without  involving 
any  other  junction  than  these  two.  A  dashed  line  designates  a  iliivci  link  while  a  solid  line 
designates  a  final  link.  If  there  is  a  direct  link  between  a  call-origination  junction  and  a 
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call-destination  junction.  then  a  telephone  conneelion  is  lirst  attempted  on  this  link,  the  first 
ehoiee  route  Should  the  lirst  ehoiee  eonneetion  fail,  then  an  attempt  is  made  to  alternately 
route  the  eall  b>  was  of  final  links,  and  in  this  ease  the  traffic  is  referred  to  as  tra/fu. 

Arrows  in  figure  I  indicate  the  overflow  routing  scheme.  In  ease  no  direct  link  exists  between 
a  eall  origin  and  destination,  then  the  eall  is  also  routed  along  the  final  links.  Should  a  connec¬ 
tion  on  final  routes  fail,  we  say  that  the  eall  is  "lost."  and  the  caller  must  try  to  place  the  eall 
again 


I  k.i  ki  l.  A  neiwurk  hicrachv  with  direci  (  —  —  —  —  )  and  final  ( - ) 

links  where  overflow  fioni  a  direel  link  onio  a  final  link  is  imlie.ued  by  an  arrow 


The  basic  problem  attacked  in  this  paper  is  one  of  design  synthesis:  solve  for  least-cost 
equipment  changes  in  a  given  network  routing  hierarchy  which  are  sufficient  to  meet  altered 
point-to-point  customer  demands  for  service  during  different  times  of  day  to  within  a  prescribed 
blocking  probability.  The  emphasis  is  on  the  provision  of  a  telecommunications  service  by  an 
optimal  use  of  available  equipment.  The  model  we  develop  includes  a  probabilistic  specification 
of  customer  demand  by  time  of  day  and  includes  alternate  routings,  where  each  direct  link  has 
a  uniquely  specified  alternate  route  in  the  hierarchy.  It  is  a  nonlinear  integer  program  P,  which 
takes  as  a  basic  "unit"  of  equipment  the  concept  of  a  "trunk."  The  terminology  Requires 
elucidation. 

In  this  paper  a  trunk  shall  merely  refer  to  a  channel  which  is  required  in  order  for  a  tele¬ 
phone  call  to  transpire  between  two  junctions  in  either  direction.  As  such,  it  is  a  dimensionless 
quantity  The  call  carrying  capacity  of  a  trunk  depends  on  the  probabilistic  mechanism  underly  ¬ 
ing  customer  calling  patterns,  for  example,  during  a  fixed  hour  a  trunk  could  carry  60  one- 
nnnuie  serially  placed  telephone  calls.  Under  this  discipline  the  total  carried  load  during  the 
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hour  is  .'Mill  call-seconds,  denoted  .'(>  CCS.  I  xpressed  another  wav.  observe  lh.il  ihc  proba¬ 
bility  nl  a  call  being  blocked  is  zero.  On  die  oilier  hand  should  a  demand  lor  Ml  one-minute 
calls  occur  siniullaneousK .  then  die  offered  load  is  siill  ah  C(  S.  bin  only  Oh  (  CS  is  actually 
carried  I  he  blocking  probability  is  now  59/M) 

A  collection  of  trunks  loining  two  distinct  junctions  is  merely  referred  to  .is  a  nun  A  ^nni/i 
It  is  convenient  to  v  iew  a  link  as  a  trunk  group  According  to  network  engineering  principles,  it 
has  been  customary  to  assume  that  customer  originated  calls  are  generated  by  Poisson  process 
and  are  assigned  seiiuentialh  to  a  trunk  group  I  liese  assumptions  yield  an  important  property 
which  is  fundamental  to  our  development  ol  a  good  linear  programming  approximation  to  the 
nonlinear  integer  program  /’.  namely,  that  the  carried  load  on  the  Iasi  trunk  is  monoionieully 
decreasing  with  the  number  of  trunks,  see  Messerli  |I5!  I  he  necessary  results  upon  which  the 
linear  programming  construction  is  based  are  proved  in  the  Appendix 

I  he  nonlinear  and  linear  supply  models  of  this  paper  employ  certain  concepts  ol  unit  costs 
with  respect  to  both  trunking  and  switching  The  definition  ol  "cost"  shall  be  limited  to  the 
incremental  investment  cost  of  providing  a  trunk  on  die  direct  route  between  two  junctions  and 
the  incremental  investment  cost  of  providing  a  trunk  along  the  uniquely  specified  alternate 
route  connecting  these  two  junctions.  In  addition,  we  shall  include  unit  switching  investment 
costs  per  CC  S  as  a  crude  approximation  for  switching  investments  stemming  from  switching 
calls  from  one  trunk  group  to  another 

finally,  we  present  linear  programming  solutions  for  two  network  hierarchies  occurring  in 
the  field,  one  of  which  has  been  designed  using  nonlinear  steepest  descent  methods,  see  Eisen- 
berg  [5 1  and  Eisner  |6|  This  section  also  contains  a  user's  guide  for  implementation  of  the 
model 

2.  APPROACHES  TO  DETERMINE  TRl  NKINf.  AM)  SVMK  HIM.  REQl  1  REM  ENTS 
TO  MEET  DEMAND  EOR  SERVICE 

Over  the  past  ad  years  it  appears  that  there  have  been  at  least  two  basic  approaches  to  the 
design  synthesis  problem  discussed  in  the  previous  section. 

I  he  basic  thrust  of  our  paper  proceeds  according  to  what  we  term  the  ln\i  appnuuh  to  tile 
design  problem  It  incorporates  specific  probability  distributions  for  each  parcel  ol  trallic.  where 
a  parcel  is  merely  that  portion  of  trallic  which  follows  specific  routes  in  the  network.  Dillerent 
parcels  experience  dillerent  blocking  probabilities,  even  on  the  very  same  trunk  group  for 
example,  a  given  trunk  group  may  accommodate  customer  originated  traffic  governed  by  the 
Poisson  probability  distribution,  and  the  group  nitty  also  accomodate  overflow  trallic  which  is 
" peaked.’  in  the  sense  that  the  mean  of  the  distribution  is  less  than  its  variance.  Investigations 
of  the  blocking  probabilities  of  individual  parcels  have  been  made  by  Wilkinson  |20|.  kai/  1 1  2 1 . 
and  more  recently  by  Deschamps  |4|. 

The  pioneering  work  representing  a  probabilistic  approach  which  has  had  widespread  use 
throughout  the  telecommunications  industry  is  the  1954  paper  by  Truitt  |I9|.  The  generally 
accepted  name  of  the  method  reflects  the  fact  that  economic  considerations  are  also  part  of  the 
analysis  The  method  is  termed  the  "EC'CS-method,"  where  the  letter  stands  for 

"economic."  The  method  was  introduced  by  Truitt  for  the  simplest  of  routing  hierarchies  con¬ 
sisting  of  a  triad  of  junctions  with  one  overflow  possibility,  and  one  specific  time  of  day  (single 
hour)  The  solved-tor  variables  are  the  specific  sizes  of  all  trunk  groups. 


further  important  extensions  of  the  I  CCS-method  occurred  in  three  directions  lust, 
more  accurate  refinements  of  the  overflow  distributions  themselves  were  made  following  the 
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"equivalent  random  method"  of  Wilkinson  |2(l|  Second,  more  complicated  network  hierarchies 
were  introduced,  see  for  example  Rapp  |l>|  I  he  third  advance  involved  incorporating!  traffic 
overflows  and  constraints  on  blocking  probabilities  lor  more  than  one  lime  ol  da.  in  the  same 
cost  minimi/ation  model,  see  Rapp  |lx|  and  I  asenberg  )sj  It  appears  that  it  is  necessary  to 
consider  overflow  irallie  for  multiple  times  of  day  in  order  to  determine  trunk  gmup  si/es 
which  meet  staled  blocking  probability  constraints  In  addition,  networks  based  on  held  data 
have  been  reported  in  L  i  sen  berg  |s|  and  1  Isner  |b|  where  potential  cost  savings  mav  he  realized 
hv  incorporating  multiple  times  of  dav 

file  scioml  maun  approtnh  to  iletermme  levels  ol  telecommunications  equipment  appeared 
in  the  I *-)>(>  paper  ol  kalaha  and  Juneosa  lilt  Th"''  approach  is  based  on  a  linear  program¬ 
ming  model  for  a  classical  routing  problem  having  van.,  ile  link  capacities,  and  as  such  is  a  laige 
scale  one  Several  contrasts  to  the  first  approach  (embodied  in  the  I  CCS  method)  are 
apparent. 

first,  the  parcels  of  traffic  in  the  Kalaba-Juncosa  model  are  deterministic.  Irallie  originat¬ 
ing  at  junction  t  and  terminating  at  junction  /  is  a  given  constant,  a ,  Second,  demands  are 
specified  for  each  year  (or  other  relevant  time  period),  in  contrast  to  a  specification  for  multiple 
"hours"  within  a  fixed  time  period.  Consequently,  link  capacities  may  be  specified  tor  ensuing 
future  periods,  hut  the  impact  of  multiple  busy  periods  within  a  given  period  has  not  been 
modeled. 

In  spite  of  severe  deterministic  assumptions,  the  pioneering  linear  programming  model  of 
Juneosa  and  Kalaha  can  theoretically  accommodate  all  conceivable  routing  possibilities,  for  their 
trallic  variables  arc  indexed  by  an  origin-destination  point  pair  and  also  an  intermediate  switch¬ 
ing  point,  over  all  possible  triads. 

About  5  years  after  the  Juncosa-Kalaba  paper,  a  series  of  papers  written  by  Ciomory  and 
Hu  on  communication  network  flows  appeared  in  the  SIAM  Journal  [8 ] ,  [9] .  [I()|.  Their  work 
occurred  over  a  4-year  period  and  expanded  significantly  the  size  of  the  linear  programming 
network  models  that  could  be  treated  computationally.  They  were  able  to  combine  features  of 
generalized  linear  programming  decomposition  techniques  with  efficient  Tord-Tulkerson 
methods  tor  solving  network  subproblems.  Ciomory  and  llu  also  stressed  the  importance  of 
including  communications  demands  indexed  by  time,  such  as  time  of  day.  r.  They  proceeded 
under  the  reasonable  assumption  that  the  time  variable  assumes  only  a  finite  number  of  values. 
Alternatively,  one  could  employ  a  continuous  load  curve  with  time-of-day  varying  demand 

Ciomory  and  llu  illustrated  their  computational  approach  on  a  10-node.  20-arc  network 
with  demands  for  two  dillerent  time  periods,  and  a  given  set  of  unit  capacity  (expansion)  costs. 

Rased  on  discussions  with  engineers  in  the  field,  principally  from  the  Long  Lines  Com¬ 
pany  ol  A  I  <tt  I.  we  have  found  that  both  approaches  have  had  significant  impact  in  the  actual 
design  of  telecommunications  networks.  1  he  completely  deterministic  approach  (the  second 
approach)  has  been  particularly  important  in  delineating  first  choice,  second  choice,  etc  alter¬ 
nate  routes  between  pairs  of  lunctions  to  be  used  in  defining  a  network  hierarchy  .  Once  a  net¬ 
work  hierarchy  is  established,  economies  of  scale  are  then  achievable  according  to  optimal  use 
of  the  underlying  probability  distributions  of  originating  and  alternately  routed  customer  traffic 

\  convenient  characterization  of  a  network  hierarchy,  very  useful  to  our  approach,  is 
introduced  in  the  next  section 
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3.  CTIARACTF.RIZATION  OK  A  NKTWORK  IIIKRARC  HY 
3.1.  The  Hierarchy  Matrix 

Given  a  network  hierarchs  such  as  figure  I.  lei  us  list  the  junctions,  termed  /mini s.  as 

/>!•  P: . p„  where  </  is  a  positive  integer  Bv  a  i  a  Unix  pun  we  shall  mean  an  ordered  pair  of 

distinct  points  (</./>).  "a"  being  referred  to  as  oripm  and  "A"  as  tlcsniuiiioii 

In  Section  I  we  defined  what  is  meant  by  direct  anil  final  links.  Am  two  distinct  points 
mas  or  mas  not  he  joined  by  a  link,  hut  no  tsvo  points  can  be  joined  by  both  a  direct  and  a  final 
link  l  ach  link  mas  carry  traffic  for  each  of  its  tsvo  calling  pairs,  since  traffic  may  flow  in  either 
direction  between  the  two  points  it  joins,  l  or  any  calling  pair  (i/.A>  we  assume  that  a  call  can 
be  routed  v  ia  a  unique  sequence  of  final  links,  w  hich  we  shall  term  the  Imal  nuiniix  <>l  the  ivllnix 
pan  I  a.h ) 

Let  us  list  the  set  of  final  links  bs  the  positive  integers.  ./  =  1,  2 . K  We  list  the  set 

of  calling  pairs  also  bs  positise  integers,  i  =  1.  2 . V„.  where  . V, ,  =;  i/ti/  -  I  >  is  the  total 

number  of  calling  pairs  in  the  network 

for  purposes  of  algebraic  representation  we  display  final  routing  its  a  matrix  which  has  a 
row  for  each  calling  pair  /  and  a  column  corresponding  to  each  final  link.  ,/.  We  term  this 
matrix  the  liuranln  matrix.  denoted  Itt./I.  and  specify  the  entry  in  the  /-th  row  and  ./- 1 h  column 
to  be  a  nonnegative  integer  defined  as  follows: 

the  integer-valued  position  of  the  ./-th  final  link  in  the  final  routing  of 
U  )  tt,i  =  calling  pair  i.  if  final  ./  belongs  to  this  sequence 

d.  otherwise. 

Observe  that  the  row  indices  of  the  non/ero  entries  in  the  ./-th  column  represent  all  the  calling 
pairs  which  utilize  final  ./  in  the  final  routing  of  calls.  We  denote  the  set  of  these  non/ero 
indices  II  ,. 

A  certain  subset  of  the  calling  pairs  may  also  be  served  by  direct  links,  such  as  the  ones 
drawn  as  dashed  lines  in  I  igure  I  These  calling  pairs  are  known  as  high  trvv.cr  calling  pairs,  and 
the  direct  links  as  high  usage  links  The  case  where  there  are  no  high  usage  links  may  be 
treated  without  loss  of  generality  as  one  with  high  usage  links  hav  ing  0  number  of  trunks,  latch 
high  usage  link  provides  a  direct,  first  choice  route  exclusively  for  trallic  between  its  endpoints, 
in  both  directions,  while  the  remaining  nonhigh  usage  calling  pairs  rely  solely  on  final  routing. 
Overflow  traffic  from  a  high  usage  calling  pair  shall  merely  follow  the  uniquely  specified  final 
routing  for  the  pair 

Lach  high  usage  link  is  associated  with  two  high  usage  calling  pairs,  each  with  the  same 
points,  but  oppositely  ordered  Thus,  if  there  tire  3/  number  of  high  usage  links,  there  are  23/ 
number  of  high  usage  calling  pairs,  and  23/  is  tin  even  integer  Observe  also  that  AH.  as  the 
product  of  tm  odd  and  an  even  integer,  is  itself  an  even  integer,  and  so  for  some  integer 
V.  \„-  2 A 

This  discussion  suggests  relabelling  the  calling  pairs  using  the  integers  A . 

2.  1.1,2, . V  I  or  instance,  I  and  -  I  represent  pairs  of  the  same  two  points,  but  with 

opposite  ordering,  meaning  the  opposite  direction  for  trallic.  Let  us  further  specify  that  the 
integers  V/ .  1.1 .  \l  are  reserved  for  high  usage  pairs.  Since  existence  of  final 
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links  is  assumed  .mil  no  selling  pan  is  joined  by  hoih  a  direct  and  a  final  link,  n  follows  that 
\l  \  Moreover.  /  is  a  ilonhigh  usage  calling  pail  il 'and  only  il'l/l  >  A / 

Consider  I  igurc  I  tor  the  purposes  ol  illustration  I  here  are  X  nodes,  so  there  are  XIX 
!•  -  s<,  calling  pans  I  Inis  \  '•<>/ 2  -  2X.  I  here  are  7  final  links,  and  X  high  usage  links. 

lienee  1(>  high  usage  pairs,  labelled  X.  7 .  I.  I.  2 . X.  The  remaining  calling 

pairs,  labelled  2X .  ').  d . 2X  are  serviced  only  by  final  routing.  A  portion  of  the 

hierarchy  matrix  is  given  in  I  able  1  I  he  lull  mairix  has  3h  rows,  and  7  columns. 


I  AM  I  I  I  1‘iirtioii  of  the  llmvnln  Matrix  of  ii^itre  / 


final 

. Calling  Pair 

and  Its  Integer 

Index 

link 

ind  Its  Integer  Index 

AH 

1 

AC 

Al) 

M 

Cl 

CCi 

(II 

2 

_ 

3 

4 

5 

6 

7 

((/.«> 

3 

2 

0 

0 

■ 

0 

1 

0 

!/.//)  -2 

3 

2 

0 

0 

1 

0 

0 

</.  1)  1 

2 

«) 

0 

1 

0 

0 

0 

I  1./.  1  1 

1 

c 

0 

7 

0 

0 

0 

(«./)  2 

1 

) 

0 

» 

3 

() 

0 

(//.(;)  3 

1 

2 

0 

0 

<> 

3 

0 

II./)  9 

0 

’ 

J .... 

0 

0 

0 

0 

Ify  labelling  the  high  usage  links  bv  the  integers  I . \l.  we  can  have  link  /correspond 

to  the  high  usage  pairs  /and  •  /  We  then  relabel  the  final  links  as  \l  +  I .  \/  +  A.  and 

relabel  the  rows  and  columns  ol  the  hierarehv  matrix  in  the  same  manner  as  we  did  the  calling 
pairs  and  final  links,  respectively  I  bus.  \  refers  to  the  lirsl  row  in  the  mairix,  and  <  A/  +  I  ) 
lire  lirsl  column,  but  this  departure  Irom  orthodox  notation  is  compensated  for  bv  added  con¬ 
venience.  In  practice  il  is  onlv  a  mailer  of  defining  two  label  vectors. 

To  summarize  the  listings,  when  we  write  "link  (or  trunk  group)  L"  "high  usage  link  /", 

and  "final  link  J"  it  shall  be  understood  that  A  €  1 1 . M  +  Ac  1 .  /  €  (1 . M),  and  J  € 

I M  +  1 . V/  +  A  )  respectively.  Similarly  for  "calling  pair  /”  and  "high  usage  calling  pair 

i j  6  f-.V . -1,  1 . V|  and  t  (  {- M . -1,1 . M\  respectively. 

A. 2.  Classify ing  Poim-to-Point  Offered  loads 

for  each  calling  pair  /  there  is  a  nonnegalive  demand  for  traffic  denoted  a  termed  on- 
xmaim.u  ualtn  I  rallic  is  usually  stated  in  units  of  erlangs,  or  in  hundred  call-seconds  per  hour 
lOOSl  as  discussed  in  Section  I 

Let  ./  be  a  fixed  final  link  Traffic  parcels  offered  to  ./consist  of  two  types:  overflow  traffic 
from  high  usage  calling  pairs,  and  final-routed  traffic  from  nonhigh  usage  calling  pairs  The 
lypes  are  distinguished  because  of  I  heir  different  probability  distributions,  as  seen  in  the  next 
section 
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Because  ol  t  his  distinction,  it  is  useful  to  separate  11/  into  two  subsets; 
1 1 V  —  1/  t  II/;  l./l  <  A/),  i.e,,  the  high  usage  calling  pairs  overflowing  onto  final  link  J,  and 
llj  =  [/  €  11/  :  |/|  >  A/),  i.e.,  those  nonhigh  usage  calling  pairs  utilizing  ./  in  their  final  rout¬ 
ing.  Clearly, 

11/  =  II'/'  U  II J 
and 

II y  n  II/1  =  V. 


Two  simple  examples  from  figure  2  below  illustrate  this  classification,  where  in  both 
networks  (a)  and  (b)  final  links  are  designated  by  2  and  A.  Network  (b)  has  no  high  usage  cal¬ 
ling  pairs,  while  in  (a)  the  single  high  usage  link  is  denoted  by  1. 


Numbering  Scheme  for  pairs  in  (a)  and  (b): 


C  'a  I  liny  {Kill 

(/)!./);) 

<  /’!./»  I  > 
'/'.'■/'l> 

<  />!•/>.'> 
</),./),) 


Index  Sumbi'r 

-3 

-2 

- 1 
1 

3 


lie  ,i  HI  2  I  wo  triads 

In  <a),  II"  =  II"  =  1-1.1),  while  111  =  {-2.  2i ,  and  111  =  1-3.3).  In  (b).  II V  =  11','=  0,  while 
111  =  1-2.  -  I.  I.  2)  and  111  =  |-3,  -I,  1,  3). 

4.  THK  FORMULATION  OF  A  NONLINEAR  SUPPLY  MODEL 

4.1.  Blocking  Probabilities  and  Oxertlow  Traffic 

The  call  discipline  is  one  of  the  factors  in  determining  the  relationship  between  the 
offered  load  to  a  trunk  group  and  its  carried  load.  Another  key  factor  in  determining  carried 
loads  is  the  assumption  that  customer  originated  traffic  is  Poisson  distributed  with  arrival  rate 
denoted  by  K.  sec  Messerli  [131.  fortunately,  there  is  strong  evidence  to  suggest  that  the 
number  of  calls  occurring  in  a  fixed,  small  time  interval  can  be  adequately  modeled  as  a  Pois¬ 
son  probability  distribution  With  these  assumptions  the  distinction  between  a  trunk  group's 
offered  load  and  carried  load  can  now  be  made  precise. 
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Assume  !h;il  culls  are  assigned  sequentially  to  a  trunk  group  consisting  of  n  trunks.  Let  A 
denote  the  average  customer  arrival  rate  according  to  the  Poisson  distribution.  The  only 
assumption  required  on  customer  calling  time  is  that  it  has  finite  mean /a.  Otherwise,  it  may  be 
arbitrarily  distributed.  Under  these  conditions  the  probability  that  all  of  the  //  trunks  in  the 
group  are  buss  is  given  by  the  classical  Erlang  /M'ormula 


(21  Hin. a)  =  UiVn'M  £  ia'ik  !>. 

for  n  =  0.  I.  . .  .  M'/ti  iv  a  -  A /a  a iili  its  ii/nrs  termed  erhiuK s  The  history  of  the  original 


Erlang  formula  and  its  important  generalizations  may  be  found  in  (inedenko-kovalcnko  1 7 1  anil 
Sv  ski  1 1  81 

\n  erlang  is  thus  a  measure  of  the  How  of  traffic  per  unit  time.  In  the  traffic  engineering 
literature  an  erlang  is  one  call-hour  per  hour,  or  equivalently  36  CCS  per  hour.  The  "hour"  as 
the  unit  of  time  is  so  standard,  it  is  usually  dropped,  and  one  says  an  erlang  is  36  CCS.  The 
value  ”</"  in  the  L.rlang  formula  is  termed  the  mean  of  the  offered  load  to  the  given  trunk 
group.  The  expected  overflow  traffic  is  then  a/Uii.a).  With  traffic  flowing  in  both  directions, 
similar  formulas  apply 

Suppose  for  some  integer  /.  3/  ^  ^  M.  a  traffic  intensity  a,  from  high  usage  calling 
pair  i  is  offered  to  high  usage  link  /  consisting  of  v,  number  of  trunks,  where  /  =  |/|. 
(Through  (he  paper  we  shall  always  take  /  =  |/|  in  that  context.)  According  to  (2)  above,  the 
probability  that  all  .v,  trunks  are  busy  is  Hix/.a,  +  a  recalling  traffic  intensity  a  ,  running  in 
the  reverse  direction  shares  the  trunks  on  /.  Hence.  a, Hix/.a,  +  a  ,)  is  the  expected  amount 
of  traffic  overflowing  to  the  first  link,  J.  in  the  final  route  sequence  of  /.  Final  link  /.  however, 
carries  other  parcels  of  traffic  as  well,  as  seen  in  Section  3.2:  overflows  from  the  other  high 
usage  calling  pairs  represented  in  II",  and  traffic  from  the  nonhigh  usage  pairs  represented  in 
II  j.  We  next  formalize  the  idea  of  the  quality  of  service  of  the  network  and  introduce  a  useful 
assumption  on  the  marginal  capacity  of  a  trunk  group. 

4.2.  Network  Quality  of  Service  and  an  Assumption  on  Marginal  Capacities 

The  important  benefits  of  being  able  to  compute  changes  in  equipment  slock  to  meet 
changes  in  demand  were  recognized  much  earlier  by  Kalaba  and  Juneosa  (III.  Gomory  and  Hu 
(8|.  [9j.  |l()|  and  others  fortunately  ,  incremental  studies  on  the  network  hierarchy  introduced 
in  Section  3  permit  certain  simplifying  assumptions  that  make  computations  attractive.  These 
assumptions  relate  to  the  concept  of  the  marginal  capacity  of  an  additional  trunk  at  a  prescribed 
blocking  probability  The  resulting  supply  model  is  an  optimization  which  is  much  simpler  than 
would  be  possible  when  constructing  a  network  ah  initio.  The  assumptions  and  model  are  now 
presented. 

DEFINITION:  Assume  Q\ ,  the  traffic  offered  to  a  given  link  L  at  time  t,  has  a  fixed  pro¬ 
bability  distribution,  and  let  mlQlf  denote  its  mean.  Given  that  L  consists  of  x,  number  of 
trunks,  let  p. ,  f.V/  .Qj  )  denote  the  mean  of  the  overflow  distribution.  The  carried  load  is  that 
portion  of  the  offered  load  which  does  not  overflow.  For  the  case  that  L  is  a  final  link  J.  the 
quality  of  service  p  j  of  final  link  J  is  defined  by  f>  j  =  l  -  Pj,  where 

M/'xv.y;) 
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Since  the  overflow  is  less  than  the  offered  loud,  p,  lies  between  0  and  1  The  mean  of  the 
carried  load  is  in  (Qj)  -  /a /( X/.yj).  According  to  the  network  hierarchy,  overflow  from  any 
linn  I  link  is  lost.  Let  us  illustrate  how  these  definitions  are  employed  in  calculating  carried 
loads  on  serially  connected  final  links,  in  a  simple  example  consisting  of  final  links  I.  2.  and  3 
as  shown 

O  ~  ~ 

Assume  that  the  only  offered  loads  on  2  and  3  stem  from  carried  loads  on  1  and  2,  respectively 
Assume  that  there  is  only  one  time  of  day  /„  and  one  quality  of  service  />.  Thus. 

P  i 1  V/ ■  Qj  >  _  , 

in  ( Qi )  1 

and  m  ( (2/*  | )  =  m  ( Qj )  -  /a  / (.V/ .Qj)  for  J  =  1,2.  I lence,  m  ( Qj  + 1 )  =  m  ( Qj )p .  J  =  1,2  and  so 
m  ( Q\)  =  m  <  Q\  )p\ 


A  formal  extension  of  this  argument  shows  that  for  any  final  link  ./  the  mean  of  the 
overflow  from  any  high  usage  calling  pair  /  €  11/  is  at  least 

13)  a'  fp"'1  " 

providing  II)'  is  nonempty  and  where  />  =  maxIp/jT  =  M  +  1 . \l  +  K ). 

Marginal  C  opacity  Assumption 


l  et  p  be  fixed,  for  each  ./  we  assume  that  there  exist  two  positive  constants  y  ,  and  h, 
such  that  if  r+  and  r“  are  two  offered  loads  having  probability  distributions,  and  hi(t+)  >  0. 
then 


P-j 


(4a) 


,  „  m(r  +  )  ,  ...  . 

A,  +  - /  .Q',  +  T  + 

y  j 


111  (  Qj  +  T+) 


<//(=!-/)>, 


and  if  0  <  mfr  )  <  bj  then 


(4b) 


max 


Pi 

-V/  - 

ml--) 

.Qi  ~  r- 

y  / 

"i(Qj  -  t  ) 


< 


where  <.v>  is  the  smallest  integer  greater  than  or  equal  to  v,  termed  the  integer  roundup  of  ,v. 
and  where  lx  1  is  the  largest  integer  less  than  or  equal  to  v  termed  the  integer  part  of  v  y ,  is 
termed  the  marginal  capacity  of  an  additional  trunk  at  quality  of  service  p . 


Inequality  (4a)  states  that  when  <m(r  +  )/y/>  number  of  trunks  are  added  to  final  link  ./. 
then  at  least  an  additional  amount  of  traffic  pr+  is  carried.  Inequality  (4b)  states  that  when 
|»»lr")/y/|  number  of  trunks  are  removed  from  the  trunk  group,  then  the  decrease  in  carried 
traffic  is  at  most  pr~ . 


We  assume  throughout  that  each  high  usage  group  /  consists  of  \/  (integer)  number  of 
trunks,  and  that  each  final  group  ./  consists  of  v,  number  of  trunks,  establishing  what  we  term 
the  eMstinn  network.  It  is  further  assumed  that  the  existing  network  can  supply  all  service 
demanded  a'  for  all  pairs  /  and  all  times  of  day  t  with  the  provision  of  a  quality  of  service  p 
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4.3.  A  Nonlinear  Integer  Programming  Formulation  for  the  Network  Hierarch)  of  Section  3 

The  first  lask  is  to  develop  an  expression  lor  the  sum  of  the  trallie  parcels  of  Section  3.2 
oll'ered  to  a  final  link  ./  of  the  esisiui);  neimirk. 

4. a.  /.  Sum  o!  Ovation  Panels  Offered  in  l  inal . I 

Summing  the  mean  overflows  in  (3)  yields  a  lower  bound  for  the  mean  total  overflow 
trallie  parcels  oll'ered  to  trunk  group  ./.  Let  this  sum  be  denoted  by  /.;'!/),  j.c,. 

131  /."(/)—  y.  (pH  lx,.  Cl'  k  o'  )/)  - — 

M  *,l 


for  each  linal  trunk  group  ./.  where  we  define 


IT,  I 
IT, I 


=  0  if  7r,/  =  0.  (  I  his  convention  shall  be 


used  throughout  the  paper.)  Ih us.  a  term  in  the  summation  stemming  from  calling  pair  /  is 
automatically  set  to  0  if  final  link  ./  does  not  belong  to  the  final  routing  of  i.  for  the  case  that 
II1/  is  empty.  < 5 )  automatically  reduces  to  zero,  a  case,  for  example,  which  does  not  occur  in 
figure  I  An  upper  bound  on  the  total  overflow  trallie  to  ./  is  obtained  by  deleting  the  //-term 
in  expression  15). 


4 .  .v  Sum  ni  Panels  Ol  feral  in  f  inal  .1  hum  Ximfiiyh  L  satie  (  alibi, u  Pairs 


lor  any  A  (  II  j  it  follows  analogously  to  (4)  that  the  expected  portion  ol  originating 
trallie  parcel  a,  oll’ered  to  trunk  group  ./  is  ’  .  provided  that  II J  is  nonempty  Summing 

all  these  parcels  of  trallie  yields  a  sum  which  we  denote  /./(/): 

(M  /-/It)  =  L  1 

W  »  •  v 

4.4.4.  I  (  i msirami  mi  the  Sum  of  III  frat/li  Offered  in  f  inal  I 


77  w 
77*./ 


I  he  maximum  total  expected  offered  load  /.,.  which  linal  group  ./  of  the  existing  network 
can  service  at  blocking  probability  1  //  is  the  maximum,  over  all  times  of  clay  /.  of  the  sums 

of  both  types  of  expected  tillered  load  parcels  Accordingly  . 

1 1 1  /.,  =-  max!  /./’</ >  t  /./(/)}. 

Our  modeling  approach  is  basically  an  incremental  one  involving  (i)  modified  offered 

loads  d;  for  all  pairs  /.  (lit  modifications  of  the  number  of  trunks  I . V/  +  A.  and 

(iii)  a  modification  in  the  network  service  quality  f>  Under  these  three  kinds  of  modifications, 
we  may  define  quite  analogously  to  (5)  and  (6)  the  expressions 

l.'/U  I  and  /./if), 
and  analogous  to  (7)  write 
I X I  /.,  -  max!  /.;"l/)  *  /,</>! 


I  he  dtlference  /./  /./  is  the  mean  ot  the  additional  trallie  distribution  on  filial  link  ./and 

so  our  assumption  on  capacities  applies  I  herelore.  il  /.,  /  ,  >  0,  then  by  case  t4a).  only 

I/.,  /,)/>,>  number  ol  trunks  need  he  added  to  final  group  ./.  where  y ,  is  the  marginal 
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capacity  of  an  additional  trunk  at  blocking  probability  1  f>  l  et  )/  denote  the  integer  number 
of  trunks  required  in  group  ./  in  order  to  service  initial  demand  /.,  at  the  new  service  quality  f>. 
Hence,  we  obtain  a  feasibility  requirement  on  the  modified  ./  trunk  group  si/e  \j. 

<9>  E,  y/lv/  >,> 

where  v,  is  integer. 

If  Hj  -  E,  <  0,  then  we  invoke  a  stronger  version  of  the  marginal  capacity  assumption 
regarding  ease  (4b).  We  require  that  r  —  \Ej  -  Ej\.  a  quantity  which  depends  on  the  ij  and 
certain  x,  (high  usage  si/e)  variables,  lie  within  the  0  to  bj  range  required  in  order  for  (4b)  to 
hold  In  other  words,  when  ll/V,  /.,|/y,l  number  of  trunks  are  removed  from  Yj,  the  result¬ 
ing  number  of  trunks, 

A;=  Y,  -  [I/?,  -  Ej\ly  j\ 

mai  he  oll'cred  the  modified  load  at  blocking  probability  (I  p).  It  follows  that  the  same  feasi¬ 
bility  requirement  as  ( *•> )  holds  for  this  case  too. 

The  system  of  inequalities  (9),  one  inequality  for  each  final  group  7,  shall  determine  a  set 
of  constraints  for  the  nonlinear  supply  model,  and  we  shall  write  these  constraints  in  greater 
detail  when  actually  specifying  the  model.  But,  first  we  need  to  take  account  of  the  total 
switched  traffic  in  the  network. 


4.3.4.  Accounting  for  Total  Switched  Traffic 

Let  us  work  with  the  modified  loads  a,',  modified  number  of  trunks  .ij,  and  modified  ser¬ 
vice  quality  p. 


Let  .S’,  denote  the  total  switched  traffic  throughout  the  network  at  time  t.  We  shall  now 
show  that 

W  +  A  W  (_  | )  tt  / 

do)  s,  =  X  X  (o,'»<-M/+ h  J  I  — 

!-  \t  71  lJ 


U'T'K  t-  -"j  11  I  n  U 

+  X  I  We  I  — 

/-  \t  H  \t<  *  <;  \  77  */ 


The  amount  of  overflow  traffic  from  high  usage  calling  pair  /  destined  for  final  J  is 
alBii/.al  +  a1 ,).  However,  before  this  particular  parcel  reaches  J  it  must  be  consecutively 
switched  at  the  point  of  origin,  and  the  (n  ,j  -  1)  points  along  the  alternate  route.  Therefore, 
in  this  case  the  amount  of  switched  traffic  is: 

(II)  a,'fl(.V/.o,'  +</',)  p  J 

The  same  analysis  applies  to  traffic  from  calling  pairs  served  only  by  final  routing.  The 
traffic  switched  due  to  the  originating  load  «*',  A  €  II  j  requiring  final  J  for  completion  is 
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V\ o  now,  sum  (111  over  all  high  usage  pairs  and  then  over  all  finals.  Similarly.  (12)  is 
summed  over  all  remaining  pairs  and  then  over  all  finals,  l  inally.  summing  these  (wo  yields 
I  10) 

-I  x  \  (  us i  -twumpnons  uni!  the  \oniitieur  X/iitlrl 

Analogous  to  l-.isenberg  |5|  and  l.lsner  |bl  we  shall  invoke  simplifying  cost  assumptions 
for  trunks  and  switching.  We  shall  employ  unit  marginal  investment  costs  per  trunk  and  shall 
use  the  same  cost  for  augmenting  a  trunk  group  as  for  diminishing  a  trunk  group.*  We  shall 
denote  the  marginal  cost  per  trunk  for  trunk  group  /.  hy  </  >0,  L  =  I . V/  +  K 

Changes  in  switching  investment  costs  shall  he  approximated  by  using  a  marginal  switch¬ 
ing  investment  cost  c  per  CCS  of  switched  traffic,  as  for  example  in  Lisenberg  [ 5 1 . 

In  the  absence  of  real  data  and  analogous  to  Lisenberg  15]  we  can  merely  set  ct  =  S1000 
for  each  trunk  in  group  /..  final  or  high  usage,  and  also  set  c  =  $62  (per  CCS). 

We  are  now  ready  to  slate  the  basic  nonlinear  programming  supply  model. 

PROGRAM  P:  Assume  an  existing  network  (Section  3)  has  demands  a',  for  all  pairs 


j  =  -.V . -1.  I .  V  integer  group  sizes  v,  for  high  usage  and  final  groups: 

L  =  I . it  +  K,  and  an  overall  network  service  probability  i>  with  marginal  capacities  y,. 

J  =  \1  +  l . M  +  K.  Let  modified  positive  demands  be  denoted  by  a'n  and  let  ft  denote  a 


modified  service  probability  with  marginal  capacity  yj.  J  =  M  +  1 . M  +  K.  Assume  that 

Cj  is  the  cost  per  trunk  on  trunk  group  L,  L  =  I . M  +  A',  and  that  i  denotes  the  switch¬ 

ing  cost  per  CCS.  Let  Ej  be  defined  according  to  <7).  Compute 


1 3a) 

\f  *  A 

\fr  =  min  £  t/  .v/  +  cS 

i  \ 

rom 

among  nonnegative  integers  .v/  for  L=  1.  .. 

.  .  it  +  K.  and  real  S  which  satisfy 

1 13b) 

£  aIHLx, .«/+«', )f>'n  >  "  — 

\t  U  , 

/  X  I) 

~  II  If kJ 

+  L  a[p 

w  <■  ;  $  v  n  u 

-  Ej  <  yjlxj  -  Yj) 

'In  practice,  one  rarelv  lakes  awa\  exixim*!  equipment,  hut  merel>  wails  until  the  normal  growth  in  message  volume 
lakes  up  the  current  slack 
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Observe  that  ihe  system  of  inequalities  (13b)  is  merely  (9)  with  lull  detail  <>l  the  terms  /:) 
showing  the  v,  as  variables.  On  the  other  hand  (13c)  merely  defines  the  maximum  switched 
traffic  in  the  network  according  to  (10). 

It  is  obvious  that  Program  P  is  consistent  because  the  xt  variables  may  be  taken  arbitrarily 
large  as  well  as  the  S  variable.  I’  must  have  a  finite  minimum.  Otherwise,  some  v/  necessarily 
become  arbitrarily  large  and  since  all  cost  coefficients  are  positive,  the  objective  function  would 
arbitrarily  increase  which  is  a  contradiction. 

Program  P  is  a  nonlinear  integer  programming  problem  which  can  be  well  approximated 
for  practical  purposes  by  a  continuous  convex  program.  In  fact,  even  more  can  be  done.  Pro¬ 
gram  P  can  be  approximated  by  a  finite  linear  program  based  on  the  special  convexity  property 
and  monotonicity  property  of  the  Erlang  B-function,  see  Messerli  (13)  We  focus  now  on  how 
the  linear  programming  approximation  is  constructed. 

5.  A  LINEAR  PROGRAMMING  APPROXIMATION  TO  THE  NONLINEAR  PRO¬ 
GRAM  P 

5.1.  The  Convexity  Properties  of  the  Blocking  Probabilities 

In  engineering  practice,  the  definition  of  the  "load  on  last  trunk"  with  respect  to  a  trunk 
group  of  si/e  n  +  1  which  is  offered  the  load  "a "  is  defined  by 

(14)  D(n.a)  =  Bin.a)  -  Bin  +  l.o) 

where  the  Erlang  B-function  is  defined  in  (2),  for  rt  =  0,  1 . with  B(O.a)  =  l.  Observe 

that  Din, a)  >  0  for  each  nonnegative  integer  n.  Messerli  1 13]  gives  a  proof  that  for  any  fixed 
a  >  0,  Din.a)  is  strictly  decreasing  in  the  nonnegalive  integer  variable  n, 

(15)  l)tn  +  I  .a  )  <  Din.a) 
for  o  =  0.  1 .  ... 

lor  V  fixed  define  the  polygonal  function  Bi-.a)  from  the  non-negative  reals  to  the  non- 
negative  reals  by 

( I  b)  B  ( v. a )  =  —I)  ( n, a  ).v  +  in  +  1 )  Bin.a )  -  nB ( //  +  I  ,a  ). 

where  //  is  the  integer  part,  |.v|.  of  v.  Note  that  B</,« )  =  B(r.ti)  for  each  nonnegative  integer 
r 


The  graph  of  the  polygonal  function  B(-.u)  reveals  its  convexity  and  monotonicity  pro¬ 
perties,  which  are  basic  lor  the  construction  of  the  linear  program. 

For  each  nonnegative  integer  n  the  left-hand  side  of  (16)  defines  an  affine  function  on  the 
nonnegative  reals  The  following  cumulative-type  expression  for  this  affine  function  follows 
from  Charnes-Cooper  HI,  pages  352-353. 

h>r  a  fixed  nonnenatiw  m lexer  n 

(17)  —  IHn, a  ).v  +  G;  +  1  >B( n.a  >  —  nBUi  +  1  ,a  )  =  I  +  £  (c,  —  c,  , )(  v  -  r ) 

i  »u 

for  every  real  mmnenaiive  v,  where  c  ,  =  0  and  <>  =  -  Dir, a  )  for  r  =  0.  I . 
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As  strongly  suggested  by  Figure  3,  the  following  proposition  yields  a  uniquely  determined 
system  of  supporting  hyperplanes  for  the  epigraph  K  of  the  function  HI-. a)  The  proposition 
and  its  three  corollaries  shall  be  proved  in  the  Appendix. 


B(0,a)-3(l,a) 


B  ( 1 , a )  -  B  (  2  ,  a) 


I  ic *(  hi  1  The  polygoniil  lunction  determined  b>  the  l  rl;ing 
/Munition  on  nonnejutne  integers 


PROPOSITION  1:  Let  K  be  the  epigraph  of  Hi -.a).  K  -  { ( r..v )  6  DC|.v  ^  0  and 
r  5  /fl.v.r/))  l  et  /.  he  the  set  of  till  (r,.v)  in  IR  which  satisfy  the  semi-infmite  system  of 
linear  inequalities 

1 1 X I  :  ~  \  Z  £<r,  -  c,  |M.\  -  r  ) 

t  -  II 

for  v  0  and  n  =  0.  1 .  2 . 


Then  K  =  /.  and  K  is  nonempty. 

COROLLARY  I  l  et  \  be  nonnegative  real.  The  ( Hix.u  l.v )  satisfies  each  inequality  of 
(18)  strictly  except  for  <i).  the  inequality  indexed  In  |.v|  i.e.,  the  inequality 

:  -  I  ^  (t,  -  c,  |M.v  -  r). 

i  II 

which  it  satisfies  as  an  equality,  and  (ii>  possibly  the  inequality  indexed  by  |.v|  -  I  when 
\  I  T  he  latter  inequality  is  satisfied  as  an  equality  if  and  only  if  v  is  a  positive  integer. 

COROLLARY  2  l  et  I  be  a  positive  integer  and  set  K  =  K  D  |(r..v)|0  ^  v  <  I  I.  Let 
/.  be  the  set  of  all  (r.v)  which  satisfy 


(  OMMl  Nl<  AllONS  Ml  WORKS  Will  I  IIMI  VAKYINt.  I  )l  \1  AMDS 


|s 

;  -  I  ^  £(<,  -  <,  i ) ( . v  -  /  ).  a  ^  0 

/  0 

for  n  =  0.  I .  I  I.  I  hcn  A  = 

C'OROl  [ARY  3:  (z..v)  €  A  is  an  extreme  point  of  K  if  and  only  if  a  is  a  nonnegative 
integer  and  :  =  Hix.a  ). 

In  view  of  f  igure  3.  which  reflects  the  basic  integer  convexity  property  (15).  these  results 
are  intuitively  clear.  They  tire  formally  proved  in  the  Appendix 

5.2.  The  Key  Approximation  and  the  Linear  Program 

We  now  replace  in  Program  P  the  0-function  by  the  polygonal  0-function.  and  the 
integrality  conditions  on  the  xt  variables  are  removed,  finally,  upper  bounding  constraints 
V/  ^  I  /  are  imposed,  where  the  l)  are  large  positive  integers. 

The  next  step  replaces  each  term  d,'0(.x/,a,' +  a'.,)  in  (13b)  and  (13c)  with  the  new  vari¬ 
able  and  requires  that 

alB(x/,a +  a'.,)  ^  z,'. 

The  new  approximation  program  so  obtained,  denoted  P\  is  the  following. 

PROGRAM  P':  Same  assumptions  set  as  in  P.  Let  l)  be  large  positive  integers  for  high 
usage  links  Compute 

M-K 

(19a)  V/(,  =  min  £  c/.V/  +  c.V 

i  - 1 

from  among  reals  xt .  and  V  which  satisfy: 

(|9b)  A /(/)  ^  a>.  where  .V; ( / )  = 


I  :r. . 

Hjl 

trr .  .  1  i 

+  I  m  r>i 

ttu 

-  Ki  +  yjXj 

-  \1 

/  XU 

it  ij 

\i<  k  <  \ 

ttu 

for  each  final  J  and  each  /,  and 

(19c)  Sit )  ^  S.  where  Sit)  = 


WfA 

v 

V  t  n 

I  1  zip  J  I 

n,J 

M  +  K  l7r  i, 

,  v  v  (  =  '  n  u  1  1 

ttu 

+  2,  i  ak  p  ) 

J-K1  +  1 

/-  M 

n  iJ 

J-  Vf *  1  M<|K|<.V 

tt  kJ 

1*0 


for  each  /.  and 

(I9d)  a,  kixhal  +  a' ,)  ^  z,' 

for  each  high  usage  calling  pair  /  and  lime  r,  and 

(19c)  0  5?  i,  ^  l, 


for  each  high  usage  link  /. 


lb 
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it  is  obvious  now  in  view  of  Corollary  2  that  P'  is  equivalent  to  the  finite  linear  program 
denoted  LP’.  obtained  by  replacing  (I9d)  with  the  finite  system  of  linear  inequalities 

1 20)  a,'  Dir  .a,'  +  a'  ,).i/  +  z,'  ^  a,'(i >  +  1 ) fl(i>  Ml  +  a' ,) 

—  a  l  rBlr  +  I  M',  +  a  , ) 

tor  r  =  0,  1 .  1/  -  1.  and  each  high  usage  pair  /,  and  each  i.  It  is  equally  obvious  that 

Program  I  P'  is  consistent  and  has  a  finite  minimum  since  the  x,  variables  are  bounded  and  all 
cost  coefficients  are  positive.  Hence  P'  itself  has  optimal  solutions. 

VNe  now  use  Corollary  I  of  Proposition  I  to  discuss  the  cost  effects  due  to  using  an 
optimal  solution  of  P'  as  a  solution  to  the  integer  program  P.  If  high  usage  size  xt  is  not  tin 
integer,  then  «'  +  «',),  <xl>)  is  in  the  epigraph  of  Bi-.a',  +  a' ,)  for  each  t . 

where  <\’>  is  the  integer  roundup.  The  roundup  introduces  an  increase  in  the  total  cost 

associated  with  high  usage  group  /,  < < -V/*>  —  v/hy,  where  0  $5  <.v;>  —  ,v/  <  1.  An 

offsetting  cost  effect  from  final  groups  J  and  switching  S  occurs  because  from  the  monotonicity 
of  tU  Mi  *■  a  ).  each  c;  does  not  increase. 

finally,  in  order  to  insure  quality  of  service  />.  nonintegcr  final  group  sizes  .\j  should  be 
rounded  up,  thereby  increasing  total  costs.  Numerical  estimates  of  these  various  off  setting  cost 
effects  due  to  round  up  of  trunk  group  sizes  determined  by  Program  P’  have  not  been  obtained. 
It  appears  to  us  that  such  estimates  must  stem  from  numerical  experiments  on  field  data.  Cer¬ 
tainly.  as  strongly  suggested  by  f  igure  3  and  Proposition  I  and  its  Corollaries,  integer  program¬ 
ming  pathologies  from  straightforward  rounding  processes  do  not  occur. 

Because  of  the  linear  inequality  system  (20),  Program  LP'  may  be  quite  large  and  for 
practical  purposes  it  would  be  useful  to  obtain  art  equivalent  smaller  problem  in  place  of  LP'. 
The  monotonicity  of  the  /^function,  essentially  Corollary  1  of  Proposition  I.  suggests  a  useful 
procedure. 

5.3.  Solving  the  linear  Program  I.P'  Through  Bounded  V  ariable  Reductions 

Let  LP'hi)  be  the  bounded  variable  version  of  L.P'  obtained  by  replacing  1 19el  with 
<  I  9e'>  or  /  ^  V/  ^  ft  i 

for  each  high  usage  group,  and  in  (20)  restrict  r  to:  >■  =  o/ . fii  1  where  o/  and  /i,  are 

nonnegative  integers  such  that  0/  -  I  -  ^  2.  The  following  is  proved  in  the  Appendix 

PROPOSITION  2:  Under  the  above  bounded  variable  assumptions: 

(i)  any  optimal  solution  if.V/  ).  S’.  (-')]  of  LP'm,  is  feasible  lor  I  P',  and 

lii>  if  for  each  high  usage  group  / 

•  21)  or  /  <  V/  <  B  i . 


then  this  optimal  solution  is  also  optimal  for  Program  LP'  Moreover,  there  exist  <»/.  /j/  and  an 
optimal  solution  of  l.P'  such  that  with  respect  to  \/  of  that  solution,  <21 )  holds. 


) 
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6.  UOMPITKR  PROGRAM  AND  RKSl  I  TS 
6.1.  Implementation  of  tlit'  Model 

l  or  all  but  the  simplest  network  hierarchies,  the  large  si/.e  of  LP'  of  Section  5.2  warrants 
the  use  of  the  hounded  variable  reduction  program  LP'l(1)  Thus,  according  to  Proposition  2  in 
Section  5.5,  we  may.  in  general,  restrict  \y  in  (19e')  to  a  range  of  4  integers,  that  is. 
Bi  o /  =  5  for  each  high  usage  link  /.  This  in  turn  restricts  v  to  a  range  of  3  integers  in  (20). 
We  shall  also  specify  that  there  are  a  finite  number,  T,  of  periods  (hours)  during  the  calling 
day.  for  1-P’hi>.  the  variables  and  constraints  which  occur  are  accounted  for  in  Table  2. 


T.ABLf,  2.  The  I  arid  hi  vs  and  Constraints  of  Program  LP'm, 


Name 

Number 

Total 

Variable 

__ 

High  Usage  X/ 

Pinal  Xj 

Switch  S 

Overflow  z\ 

M 

K 

1 

2m  x  T 

2M  x  T  +  K  +  M  +  1 

(19b) 

K  x  T 

Constraint 

( 19c) 

(20) 

T 

2M  x  3  x  T 

T  *  (K  +  6M  +  1)  +  2M 

( 1 9e' ) 

2  M 

f  or  example,  in  figure  1.  A  =7.  M  =  8.  and  taking  T  =  3  we  have  a  total  of  64  variables  and 
1 84  constraints. 

Let  -If  <  h  denote  the  constraints  system  of  LP'm).  where  £  =  [(.\y  ).S.  (r/)]'  It  is  easy 
to  see  that  -t  is  a  sparse  matrix;  indeed,  roughly  98"/'  of  its  entries  are  zeros.  Thus  it  requires 
some  attention  to  enter  each  of  these  into  its  proper  row  and  column,  figure  4  is  a  "blueprint" 
for  the  matrix  f 


Calculating  the  entries  of  A  requires  computation  of  the  erlang  5-formula,  (2)  at  integer 
values.  However,  the  factorial  terms  involved  quickly  become  loo  large  for  direct  computation. 
Given  some  positive  offered  load  a  and  positive  number  of  trunks  n,  the  following  recursion  is 
used: 


Bin, a ) 


aB(n  —  1  ,a  ) 
n  +  aB  in  —  I  ,a  ) 


;  B(0.a  )  =  1. 


The  "load  on  the  last  trunk"  Din, a)  which  also  appears  in  (20)  merely  requires  computation  of 
Bin. a)  and  Bin  +  l, a). 


We  need  data  on  both  the  existing  network  and  the  modified  network.  As  a  simplifying 
assumption  let  us  lake  p  -  p,  meaning  the  quality  of  service  is  to  be  maintained  at  the  same 
level.  The  necessary  data  then  consist  of  a'  and  a'  for  each  calling  pair  j  and  for  each  time  f, 
p.  yj  and  y  for  each  final  J,  and  the  sizes  of  the  links  on  the  existing  network.  xr  for  each 
link  L,  Observe  that  since  p  =  p.  Yj  —  xj  for  each  final  J.  see  Section  4.3.3.  The  hierarchy 
matrix  introduced  in  Section  3  1  contains  all  the  necessary  information  about  final  routing. 


s 
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SUM: 


T  X(6M+K+1)  +  2M 


^  :  High  Usage  Group 
:  Final  Group 
g  :  Switch 
0  :  Overflow 


I  k,i  ki  4.  Siructurc  of  constrain!  4-mairix 


xi  xixi  xi  xi  x; 
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With  matrix  I  arranged  ax  in  Figure  4,  the  entries  can  be  managed  easily  Nek  dial  the 
block  of  'he  tirxi  \f  4  A'  *  1  columns  repeals  itself  lor  each  time  period  whi'c  the  nerflow 
block  shills  2\l  columns  for  each  consecutive  time  period  For  t  —  I.  the  First  <iA  1  t  A  s-  1 
rows  of  I  can  he  tilled  bv  the  following  piece  of  computer  program  (Refer  to  Figure  '  ■ 


I,  •  i  '  Hiock  ol  I  ’taii.x  corresponding  to  l  -  t  with  variable  in  position  of  entries  Muck  :  <•...«  ,.r 
/  ..  o.ihlcs  mil,  nuii/cni  loelliueiils  Uvler mini’ll  bv  permissible  overflows  according  in  network  hier.,  u. 


z' 

*-M| 

zk 

zL 


-71 

ih 


20 
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Rows  corresponding  to  (20): 

FOR  /:=  I  SHIP  I  UNTIL  M  1)0 
FOR  /:  =  1  SUP  I  UNTIL  3  1)0 
BEGIN 

4  [6 * ( /  -  I)  +  J.l)  -  - a}D(<x ,  +  J  -  \.a,'  +  a1,); 

4  [6  *(/  —  I)  +  7  +  3./1:-  -a\,D(a ,  +  J  -  l.a,'  +  «',); 

/4  [6*<  /  —  I)  +  y,  2M  +  A  +  I  +  /):= 

j4  [6  *  ( /  —  1)  +  y  +  3.  2A/  +  Af  +  2  -  /]:-  -1 

END; 

Rows  corresponding  to  (19b)  and  block  4(>  of  Figure  5; 

FOR .  /  =  M  +  1  STEP  1  UNTIL  M  +  K  DO  4  [5  A/  +  /,/]:—  -y,; 

FOR  /:=  I  STEP  I  UNTIL  M  DO 

FOR  /  =  M  +  1  STEP  1  UNTIL  M  +  K  DO 

BEGIN 

/I  [5  A/  A-  J.l  M  +  K  +  2-  /]:=  IF  ir  =  0  THEN  0 
ELSE  p u 

-4  ISA/  +  y.2W  +  a:  +  I  +  /].=  IF7TW-  OTHEN  0 
ELSE  p'n  "  " 

END; 

The  row  corresponding  to  (19c); 

•1 16  \f  +  K  +  |,  M  +  K  +  1|:=  -1; 

FOR  /:=  1  STEP  I  UNTIL  M  DO 

FOR  ./:=  M  +  1  STEP  1  UNTIL  M  +  K  L)0 

BEGIN 

A  |6 M  +  K  +  1. 2A/  +  A.’  +  2  -  /):=  IF  tt  „  =  0 
THEN  -I  |6 M  +  K  +  \.2M  +  K  +  2  -  /] 

ELSE  .4  (6A/  +  K  +  1.2A/  +  K  +  2  -  /I  +p'" 

A  [6 A/  +  Af  +  1 , 2  A/  +  Af  +  1  +  /!:»  IF  tt  w  -  0 
THEN  4  |6 A/  +  At  +  1.2A/  +  Af  +  1  +  /] 

ELSE  4  [6  A/  +  K  +  \.2M  +  K  +  1  +  /]  +p,,r"~" 

END; 

Computing  the  right-hand  side  of  the  constraint  system  is  straightforward,  although  (19b) 
involves  Ej.  the  maximum  load  offered  to  final  group  J  during  the  calling  day,  and  it  is  the  sum 
of  many  terms. 
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6.2.  Numerical  Kxamples 

The  model  was  implemented  on  two  tesi  examples,  and  the  resulting  hounded  variable 
linear  programs  were  solved  at  the  Carnegic-Mellon  University  Compulation  Center  on  a 
OTC-20  machine  using  single-precision  arithmetic. 

ft.J.I.  first  Example:  I  Setwork  Basal  on  California  f  ield  Data 

We  apply  Program  P  of  Section  4.3.5  to  the  network  given  in  la  sen  berg  ( 5 1  and  l.lsner 
|b|.  which  in  turn  is  based  on  Ciardena.  California  held  data.  The  hierarchical  structure  of  the 
network  is  given  in  f  igure  6  below 

In  this  network  there  is  only  one  originating  office  labelled  and  43  terminating  offices, 
labelled  p\  through  Pi\.  This  means  that  there  is  a  demand  for  traffic  associated  with  each  of 

the  43  calling  pairs  </>n./>, >.  i  =  1 . 43.  All  other  ordered  pairs  of  points  are  ignored. 

livery  calling  pair  is  also  a  high  usage  calling  pair.  The  high  usage  links  are  labelled  by  the 
integers  1  through  43.  and  the  finals  by  44  through  87.  Finals  45  through  87  which  connect 
office  pi 4,  the  tandem  switch,  with  each  terminating  office,  are  referred  to  as  tandem  completion 
groups.  The  overflow  hierarchy  is  indicated  in  Figure  6. 


The  hierarchy  matrix,  which  has  43  rows  (one  for  each  calling  pair)  and  44  columns  (one 
for  each  final),  consists  only  of  a  single  column  of  l’s  (for  final  44)  next  to  a  square  (43  x  43) 
block  with  2"s  along  the  diagonal,  and  0's  elsewhere.  In  fact,  this  hierarchy  is  so  simple  that 
the  matrix  itself  need  not  be  stored,  since  several  statements  written  in  a  computer  code  can 
determine  the  entries  of  ( 1 9b)  and  (19c). 

Base  Demand 

We  assume  that  the  network  is  constructed  ab  initio ,  namely  all  the  initial  demands 
between  pairs  of  offices  arc  zero  and  all  initial  trunk  sizes  are  zero.  According  to  (7),  then,  it 
follows  that  /:'/  =  0  for  J  =  44 .  87. 

Incremental  Demand 

Positive  incremental  demands  r;,'  in  CCS  for  each  calling  pair  /  and  t  are  given  in  columns 
2  and  3  of  fable  3  below  As  in  ( 5 1  and  |6|,  we  take  a  marginal  capacity  of  30  CCS  for  all  final 
groups  and  neglect  blocking  probability  on  the  final  link  44  Similarly,  unit  costs  are  S1000  per 
trunk  and  %6 2  switching  cost  per  CCS  incurred  only  at  the  tandem  switch.  With  these 
specifications  Program  P  of  Section  4.3.5  becomes  the  following  one. 
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subject  to 

4  > 

]jTu.'  IU\,.d')  ^  30v44  tor  /  =  I.  2 

-  - 1 

d  X  .<>  )  <  3I).\i44.;,  |or  ,  =  | . 41 

for  z  =  1 .  2 


and 

44 

£«/  #<.v, .«?•'»  <  .V  for  t  =-•  1.  2 

i- 1 

where  ihe  v,  are  all  nonnegative  integers 

The  above  nonlinear  integer  program  was  approximated  by  the  linear  program  derived  by 
the  methods  of  Section  5  2.  which  was  then  solved  using  suitable  bounded  variable  reductions 
based  on  Section  5  3  The  bounds  of  the  high  usage  group  sizes  were  chosen  by  our  prior 
knowledge  of  Fisenherg  s  |5j  and  Flsner's  [b|  solutions.  An  optimal  linear  programming  solu¬ 
tion  so  obtained  is  termed  the  incremented  network  Table  3  presents  an  incremented  network 
and  includes  the  overflows  from  the  high  usage  trunk  groups  to  the  final  trunk  group  44 

fable  4  compares  the  sizes  of  'he  high  usage  trunk  groups  occurring  in  our  incremented 
network  with  those  computed  in  l.isenberg  |5)  and  those  computed  in  1  Isner  lb).  Finally. 
Table  '  gives  some  overall  comparisons  among  the  three  solutions 


Ri  marks  “•!  fables  J.  4.  and  • 

In  Tables  3  and  4  each  ime.tr  programming-determined  high  usage  group  size  v.  except 
#43.  satisfies  either  (a).  <  v  »  v  <  10  "  or  (b>,  a  -  lv]  <  10  and  hence  an  integer  is 
reported  High  usage  group  #43  is  truncated  to  3  decimal  places  as  are  all  overflows,  the  final 
group  size,  and  tandem  completing  group  sizes. 

Hisenberg’s  multihour  noninteger  solution  is  not  given  in  [51,  and  consequently  the  costs 
in  Table  5  may  be  higher  than  for  the  noninteger  solution. 

Flsner's  descent  algorithm  obtains  a  solution  with  a  lower  total  cost  than  an  integerized 
solution  The  use  of  an  approximation  to  the  Erlang  0-function  (2)  applicable  to  noninteger 
high  usage  trunk  group  sizes  may  account  for  this  difference. 

6  2. 2.  The  Sa  ond  Example:  Figure  I 's  Network  Hierarchy 

We  solve  Program  1  P'  of  Section  5.2  applied  to  the  network  hierarchy  of  Figure  1  of  Sec¬ 
tion  I  with  the  following  specification  of  input  data 

Hase  Demand 

Traffic  demand  is  assigned  to  all  56  pairs  of  points  of  f  igure  I  by  daytime,  evening,  and 
nighttime  according  to  three  basic  kinds  of  pairs: 


(  I  IMMl  \l<  \  I  lo\S  \1  |  Wi  >KkS  W|  1 1|  1 1  Ml  \  \K  Yl\(.  DIMWDS 


TABLE  3— Specification  of  Incremented  Offered  Load  Demands  far 
Example  I  and  an  Optimal  Linear  Programming  Solution 
with  all  Overflows  from  High  U suite  Groups 


—  — r 

[  runk 
(iroup 

Offered  Loads 
(CCS) 

Overflow 

(CCS) 

High 
l  sage 

Tandem- 

Completing 

Hour  1 

Hour  2 

Hour  1 

Hour  2 

I  runks 

Trunks 

1 

60 

140 

3.746 

41  478 

4 

1.399 

2 

1  14 

4 

16.271 

0  000 

5 

0.542 

3 

82 

20 

10.260 

0.045 

4 

0.342 

4 

305 

76 

20.002 

0.000 

12 

0.666 

5  | 

30 

0 

13.636 

0.000 

1 

0.454 

6  1 

54 

7 

4.174 

0.007 

3 

0.305 

n 

102 

56 

4.745 

0.401 

5 

0.326 

8 

256 

161 

21  305 

1  632 

10 

0.710 

4 

366 

230 

22.406 

0.838 

14 

0.746 

III 

464 

310 

20.256 

0.548 

18 

0.675 

1  1 

115 

115 

14  595 

14.545 

5 

0.486 

12 

144 

34 

16.871 

0.013 

6 

0.562 

13 

206 

335 

3.641 

44.757 

11 

1.491 

14 

310 

650 

0.270 

89  440 

19 

2  983 

15  1 

284 

314 

13.718 

24.987 

12 

0.832 

16  1 

43 

152 

7  072 

33.258 

5 

1.108 

17 

17 

24 

5.452 

9  599 

1 

0.319 

18 

74 

325 

0.017 

73.351 

9 

2.445 

14 

102 

158 

4.424 

23.041 

6 

0.768 

20 

137 

322 

1.414 

71.323 

9 

2.377 

21 

222 

247 

10.744 

18.096 

10 

0.601 

22 

252 

340 

3.62! 

43  919 

13 

1.463 

23 

445 

144 

21  335 

0.006 

17 

0.711 

24 

176 

86 

14.441 

0.697 

7 

0.666 

25 

83 

24 

10.640 

0.227 

4 

0.354 

26 

48 

21 

17.146 

0.056 

4 

0.571 

27 

158 

74 

13.236 

0.291 

7 

0  441 

28 

124 

36 

18.441 

0.110 

5 

0.616 

24 

54 

25 

7.253 

0.700 

3 

0.241 

30 

38 

1 

8.102 

0.000 

2 

0.270 

31 

31 

17 

5.149 

1.196 

2 

0.171 

32 

140 

46 

15.286 

0.077 

6 

0.509 

33 

46 

30 

16.145 

0.262 

4 

0.539 

34 

122 

62 

17.587 

1  410 

5 

0.586 

35 

163 

57 

14.462 

0.057 

7 

0.498 

36 

163 

72 

14.462 

0.247 

7 

0.498 

37 

246 

238 

17.134 

4.745 

12 

0.571 

38 

33 

28 

5.933 

4  071 

2 

0.197 

34 

240 

3 

1 5.806 

0.000 

10 

0.526 

40 

136 

7 

13.783 

0000 

6 

0.459 

41 

54 

4 

7.253 

0.000 

3 

0.241 

42 

52 

35 

6.546 

2.063 

3 

0.218 

43 

206 

4 

13.108 

0.000 

8.997 

0.436 

Totals 

of 

Columns 

6712 

L_  _ 

5154 

508.643 

508.643 

306.997 

30  921 
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I'ABLH  4.  Comparison  of  Optimal  I  list  1 1  Usafte  hunk  Croup  Sizes 
Computed  b\  the  Multilwur  Method.  I  Descent  Method, 
and  Linear  I'roprammint;  for  tlw  Gardena  Network 


High  Usage  Group  Sizes 

Irunk 

Group 

From  Multihour 
Method  [51 

From  Descent 
Method  |6| 

From  Linear 
Programming 

! 

4 

4.42 

4 

2 

3 

5.25 

5 

3 

4 

3.78 

4 

4 

6 

1 1  97 

12 

5 

0 

1.47 

1 

6 

1 

2.81 

3 

7 

4 

4.64 

5 

8 

8 

10.32 

10 

4 

12 

14.10 

14 

10 

18 

17.57 

18 

1  I 

5 

5.37 

5 

12 

7 

6.20 

6 

13 

10 

10.81 

11 

14 

16 

18.58 

19 

15 

12 

12.14 

12 

16 

5 

5.43 

5 

17 

1 

1.08 

1 

IX 

6 

8.92 

9 

19 

5 

5.74 

6 

20 

8 

9.36 

9 

21 

10 

9.78 

10 

22 

12 

12.58 

13 

23 

17 

16.73 

17 

24 

8 

7.41 

7 

25 

4 

3.83 

4 

26 

5 

4.43 

4 

27 

7 

6.75 

7 

2X 

6 

5.44 

5 

29 

3 

2.64 

3 

30 

2 

1.86 

2 

31 

2 

1.60 

2 

32 

6 

6.06 

6 

37 

5 

4.35 

4 

34 

6 

5.40 

5 

35 

7 

6.92 

7 

36 

7 

6.93 

7 

37 

12 

1  1.84 

12 

38 

2 

1.77 

2 

39 

10 

9.70 

10 

40 

6 

5.90 

6 

41 

3 

2.59 

3 

42 

3 

2.61 

3 

43 

9 

8.48 

8.997 

Toluls 

287 

305.56 

306  997 
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I  ABIT  5.  (  ompansons  oj  Total  Xumher  o/  Trunks.  Sum  hint: 
Costs,  a  tul  Total  Costs  /or  tlw  Multi  hour.  Descent, 
ami  I. incur  Troxramminy  Solutions  oj  the  Gardena  Xetwork 


Network 

Characteristics 

Multihour 

(51 

Descent 

[6] 

Linerar 

Programming 

#  High  Usage  Trunks 

287 

305.56 

306.997 

#  Final  Trunks 

39 

NA* 

16.955 

#  Tandem  Compl. 

NA 

NA 

30.921 

Switching  Cost 

$44,640 

NA 

S3 1,537 

Total  Cost 

$405,315 

$385,500 

$386,410 

"NA  =  nol  available 


M)  each  of  the  pairs  (1.0  and  (C.  I )  receive  500  CCS  during  daytime  and  0  during  the 
other  two  periods. 

(2)  each  pair  which  includes  exactly  one  of  the  pairs  I  or  C  receives  100  CCS  during  day¬ 
time  and  0  during  the  other  two  periods,  and 

(31  each  pair  which  excludes  both  points  A  and  C  receives  75  CCS  during  daytime.  200 
CCS  during  evening,  and  100  CCS  during  nighttime. 

These  choices  were  imagined  upon  viewing  points  A  and  C  as  "commercial"  points  and 
viewing  all  other  points  as  "residential."  They  represent  particular  choices  of  the  inputs  a). 

i  =  1 . 56.  of  Program  LP’.  Analogous  to  the  first  example  we  assume  that  the  cost  per 

trunk  is  SI 000.  that  the  switching  cost  is  $62  per  CCS,  the  quality  of  service  is  0.99,  and  that 
the  marginal  capacity  of  a  trunk  in  a  final  group  is  30  CCS.  However,  we  did  not  neglect  block¬ 
ing  on  the  final  links.  Using  these  inputs  and  the  hierarchy  of  Figure  1.  an  optimal  solution  to 
LP'  was  obtained  termed  the  base  network. 


Incremented  Demand 

Assume  that  an  increase  in  demand  of  20%  occurs  uniformly  among  all  of  the  56  calling 
pairs.  With  all  other  inputs  to  LP'  remaining  unchanged  an  optimal  solution  was  obtained, 
termed  (as  before)  the  incremented  network. 

Moreover,  Program  LP'  was  solved  under  three  additional  restrictions  on  the  time  t. 
namely,  all  high  usage  links  be  sized  according  to:  (a)  daytime  loads,  (b)  evening  loads,  and 
(c)  nighttime  loads,  respectively.  These  restricted  solutions  result  from  the  requirement  that 
the  network  be  "engineered"  according  to  a  fixed  single  hour,  respectively.  This  is  in  contrast 
to  the  multihour  solutions  of  the  base  and  incremented  networks,  and  provides  a  test  of  reason¬ 
ableness  of  the  multihour  solutions. 

For  purposes  of  computer  usage,  the  size  of  LP’  was  reduced  by  the  bounded  variable  res¬ 
trictions  of  Proposition  2  of  Section  5.3.  For  example,  setting  the  V,  bounds  in  (19e)  at  25  for 
each  high  usage  group  yields  a  64  variable  linear  program  with  1240  constraints.  This  program 
was  solved  by  solving  a  finite  sequence  of  much  smaller  bounded  variable  programs  (64  vari¬ 
ables,  184  constraints).  The  results  are  given  in  Table  6  below. 
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I  Mil  I  (i  (  oni/naei  Results  ol  lour  Solutions  oj  I’roxrant  I  P  , 
S  ci  non  .■'..V  Hit  sc  and  Incremented  \ctnorks.  and 
\cnu»k  Smith  llom  Designs.  Hose  Demand  hurt  mettled 
So  I  ntlannh.  SI  mm  (  i  isl/lrunk.  $  (iS  Sit  itchtnx  (  «'<:/(.  (  S 
and  11  44  (Judith  ol  Sen ue 


1  mai  1  inks 

and 

Base 

Incremented 

1 

l. 

Single  1  lour  1  Jesigns 

integer  index 

Network 

Network 

4- 

Day  time 

1  veiling  j 

Nighttime 

,  \B 

4 

24.8.h) 

25  1)05 

i 

67  425 

43  703 

A  414 

AC 

10 

8~.  144 

104  642 

104  2 1 4 

1 1  1  244 

1  1  1.400 

Al) 

1  1 

52.7(>0 

62.467 

i 

64.622 

63.334 

60084 

HI 

12 

14.224 

17  041 

i 

24.272 

21  436 

31.838 

(  I 

13 

40  044 

48.000 

54  514 

47.871 

55.343 

C  (. 

14 

40  1)44 

48  000 

54.514 

47. 87 1 

ss  343 

;  (II  i 

15  ! 

52.824 

62  416 

i 

68  77> 

63  416 

(D  143 

High  l  sage 

Links 

and  Integer  Index 

AL 

1  | 

1 

1 

, 

1  1 

’> 

n 

m 

A 

17 

20 

0 

21 

) 

1  * 

IK,  , 

17 

2n 

i 

4 

21 

12 

Cl 

4 

“* 

N 

12 

0 

n 

I>! 

N 

18 

20 

1 

i 

4 

21 

12 

H  i 

()  , 

18 

21 

i 

10 

21 

12 

l  (i 

~ 

18 

21 

It) 

21 

12 

LH 

8  I 

21 

i 

10 

21 

12 

I  otal 

1  Switched  | 

1  Trallic 

t 

8442  02 

10172.11 

! 

| 

13764.04 

10138  47 

I  2441.15 

(CCS)  1 

' 

T 

1 

Total 

r  ‘  ‘  ' 

Cost  | 

S'.1 5'  4 

SI  143.2 

i 

S1402.3 

SI  154.0 

SI24(l. (1 

i  (1)00) 

i 

i  _ 

, 

.  _  -J- 

Observe  that  ihe  multihour  (incremented  network)  solution  has  a  lotal  cost  which  is  less 
than  each  of  the  single  hour  design  total  costs,  although  the  single  evening  hour  solution  is 
only  44%  larger  than  the  multihour  solution  Apparently,  the  opportunity  of  engineering  final 
groups  -ffland  4(’at  another  time,  namely  davnme,  permits  a  slight  sjving  in  total  cost 

7.  CONCH  SIONS 

In  t ins  paper  a  is  recommended  that  linear  programming  be  used  to  solve  for  changes  in 
trunk  group  and  switching  equipment  requirements  necessary  to  provide  for  altered  demands 
lot  telecommunications  services  and  altered  demands  for  service  qualities.  Obtaining  solutions 
!<•  this  basic  problem  is  a  major  goal  of  our  supply  model  which  seeks  to  minimize  total  incre¬ 
mental  investments  in  both  trunking  and  switching  subject  to  these  constraints, 

Ihe  linear  programming  model  distinguishes  high  usage  trunk  groups  from  final  trunk 
groups  according  to  the  role  each  plavs  m  the  network  hierarchy.  The  Important  subset  of  high 
usage  group  variables  may  be  solved  for  bv  linear  programming,  and,  in  general,  (he  crisis  due 
to  straightforward  integer  rounding  ol  these  groups  tend  to  be  offsetting  and  integer  round-off 
procedures  easily  maintain  overall  network  quality  of  service 
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(  aution  must  ho  exercised.  however,  in  the  selection  of  die  '•i/es  ol  1  lit-  final  trunk  groups 
bee. i use  ol  the  use  ol  the  maigmal  capacity  assumption  in  the  linear  programming  model  In 
practice,  the  actual  values  ol  the  final  group  variables  can  be  determine  bv  methods  which  do 
not  depend  on  the  marginal  capacity  assumption,  principally  Wilkinson's  Pquiv.ikni  Random 
Method  121.  |?tl|  Ibis  method  is  needevl  because  ol  the  various  peakedness  ell eels  li.ii  on.ii: 
in  the  probabihtv  distributions  ol  alternately  muled  traflic.  see  also  Dcsclwmps  ! 4 1 

I  he  question  ol  whether  the  linear  program  P’  provides  optimal  soli/ cos  an,  mg  micgrai 
numbers  ot  high  usage  trunk  group  sizes  is  still  an  open  one.  A  related  ciass  ol  nonlmeui 
integer  programs  which  are  solvable  as  linear  programs  is  treated  in  Meyer  |I4|.  where  various 
unimodularity  assumptions  are  made  These  assumptions  do  not  apply  in  general  to  the  class  of 
network  problems  treated  in  this  paper  Ihe  results  of  our  linear  programming  experiments  on 
two  simple  networks  m  the  field  may  stimulate  research  on  this  question 

We  shall  leave  the  linear  programming  duality  developments  for  a  later  paper  It  appears 
that  sensitivity  and  postoptimality  analyses  will  he  indeed  useful  for  network  design  synthesis 
fortunately,  hv  Proposition  I  and  its  corollaries  it  appears  that  a  much  smaller  list  of  active  dual 
variables  will  be  required  than  the  total  number  of  constraints  in  program  I.P' 

I  uture  work  should  also  incorporate  more  than  one  alternate  route  in  the  network  hierar¬ 
chy.  even  though  for  many  networks  in  ihe  held  ihe  first  and  second  choice  routes  are  preem¬ 
inent  Mi  my  networks  given  in  the  literature  are  included  within  the  linear  pi otr miming 
models  of  this  paper  l  arge  scale  network  optimizations  made  available  'lii<  i;  •  -  die  modeling 
ap'-roach  ot  dus  nap  r  si-ould  enhance  an  effective  integration  ol  the  -supply  model  wun  disag- 
c.’o’  i ted  ceoi  onviru  demand  model  for  telecommunications  services. 

W  ,  on,  hide  w  ill  an  observation  shared  bv  f'.dward  \  Sever  and  Stephen  \  Smith, 
exis.ssed  in  personal  correspondence,  that  there  is  an  interesting  equivalence  between  tele¬ 
phone  emtuieei  me  ,md  replenishment  inventory  systems,  see  I  ltd  and  ll’l.  Perhaps  the  design 
ot  more  complex  telecommunications  network  hierarchies  may  have  application  to  the  design  of 
more  complex  replenishment  inventory  systems. 
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Appendix 

PROOFS  OF  PROPOSITION  I.  THRKK  COROI.I. ARIFS 
OF  PROPOSITION  I.  AND  PROPOSITION  2 

PROPOSITION  1  l  el  A  he  defined  as. 

A  =  |(;.  v)  '  IR  l  \  >  0  and  r  ^  /# (_v.r ) I . 

Lei  /  he  the  sei  of  nil  C.a  I  in  IR-  which  satisfy  the  semi-infinite  system  of  linear  inequalities 
( I  i  :  I  U.  ~  t.  i  > ( v  —  /  >  and  v  ^  0 

for  n  ~  If.  1 . 

Then  A  =  /..  and  A  is  nonempty. 

I’ROOI  Nonemptiness  of  A  is  most  easily  seen  by  observing  that  (1.0)  €  A  since 
ti  (0.,/ )  -  H  (0. u  I  -  I 

1  et  (;.\  i  he  an  arbitrary  point  in  A  Assume  throughout  that  //  =  (vj.  the  integer  part  of 
v  Applying  ilh>  ol  Section  5  I  gi\es 

;  >  /)(//.</ 1  \  -*-<//-*-  l)/i  (//.</•  -  iiB  ( ft  +  l.cv). 

and  hence  from  (P)  we  base 

Cl  :  1  >  £<c.  ,M.\  -  /  ). 

Thus.  < r.  \  l  satisfies  the  particular  inequality  of  <1 1  indexed  by  the  nonnegative  integer  ft. 
Consider  now  any  integer  n.n  >  n  +  1  and  write 

B(\.o>  -  1  +  A"  =  £(c,  —  c,  , )  (  x  -  r) 

r-ll 

where  A,"=  £  (c,  —  c,  t  )  lx  —  r).  Now  for  any  integer  r.  n  +  1  ^  r  ^  n,  it  follows  that 

x  -  r  <  0  because  n  ^  \  <  n  +  l  ^  r.  In  addition,  c,  —  c,  i  >  0  for  each  nonnegative 
integer  r,  and  therefore  Af  <  0  for  each  integer  n.  n  5s  n  +  1.  Hence, 

(3)  :  ~  \  2  Bix.a)  -  1  >  B(x.a)  -  1  +  A,"  =  £(c,  -  c,  ,)  (x  -  r ), 

i-^O 

for  each  integer  n.  n  ^  n  +  1. 

(2)  and  (3)  together  show  that  f-.v)  satisfies  all  those  inequalities  of  (1)  indexed  by 
ii.  n  ?  ii  We  now  check  that  C.v)  also  satisfies  those  inequalities  indexed  by  nonnegative 
integers  ii.  n  ^  n  -  1 

If  ii  =  0.  there  is  nothing  to  check  for  there  are  no  such  n.  For  n  ^  I,  let  n  satisfy 
0  SJ  ii  ^  ii  ~  I  and  write 

„  tl 

B(x.a)  -  I  =  £(c,  -  c,  ,l(  v  -  /)  4  A  " 

/  n 
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where  A;-  £  (i,  i,  t)(v  -  /  ).  lor  each  integer  i.  //  +  I  ^  ^  n.  it  follows  that 

\  r  3?  0  anil  i,  i,  i  >  I)  as  before,  lienee.  A"  -j  0.  anil  hence. 

(4)  ;  -  I  Ss  iilx.  a )  -  I  ^  £<c,  -  c,  ,1  <x  -  /  ) 

r  II 

lor  each  integer  n,  0  ^  n  ^  it  I.  I  he  latter  finite  system  of  inequalities  MO)  together  with 
(2)  and  (3)  show  that  (e.x)  satisfies  (1).  implying  A  C  /,  and  in  particular  /.  is  nonempty. 

The  other  inclusion  /.  C  A  is  trivial  because  any  (r.x)  in  L  satisfies  in  particular 

--  -  1  ^  L(i,  -  i,  |Mv  -  r). 

r  n 

l  smg  (16)  and  1171  again  shows;  Si  IHx, a),  i.e.,  (;.x)  €  A, 

COROLLARY  I.  Let  x  be  nonnegative  real.  Then  ( Blx.a ),  x)  satisfies  each  inequality 
of  (1)  strictly  except  for  (i),  the  inequality  indexed  by  lie],  which  it  satisfies  as  an  equality,  and 
tii)  possibly  ihe  inequality  indexed  by  lx]  -  1  when  lx)  ^  1.  The  inequality  lx]  -  1  is 

satisfied  as  an  equality  if  and  only  if  x  is  a  positive  integer. 

PROOF:  Let  ;  =  Blx.a).  Application  of  (3)  shows  that  <r..v)  satisfies  each  inequality 
indexed  by  n.  n  >  h  +  I.  strictly,  where  n  =  lx).  By  (16)  and  (17)  of  Section  5.1.  n  follows 
that  l;..v>  satisfies  the  inequality  determined  by  «  as  an  equality  . 

It  only  remains  to  prove  that  the  inequalities  indexed  by  nonnegative  integers 
it,  ii  ^  h  -  2  are  satisfied  strictly.  There  is  nothing  to  check  if  i'i  ^  1.  For  Ti  >  2.  let  n  be 
any  integer  0  <  n  ^  >i  -  2.  Then 

Blx.a)  -  1  =  £  (cr  -  c,  ,)  (x  -  r)  +  [A  +  (c-  -  e„  ,)  (x  -  it)] 

r-0 


I 

where  .4  =  £  (c,  —  c, 

r  -  -  I 


?  0  and  4  >  0.  Hence, 


<x  —  r).  Since  n  ^  x  ^  n  +  I,  it  follows  that  (c-  -  c„  ,)(x  -  n ) 


B  ( x, a  )  -  1  >  J7(i,  —  c, 

.  ii 


,)  (x  -  r) 


for  each  integer  n.  0  ^  //  <  ii  ^  2. 

The  last  assertion  follows  from  -'\amining 

.  n  I 

Blx.a)  ■  I  =  £<c,  -  i,  |>  Cv  -  r)  +  (c„  —  cn  |)  (x  —  ii) 

t  a 

where  ii  =  lx  I  >  I.  for  the  inequality  indexed  by  n  -  I  is  satisfied  as  an  equality  if  and  only  if 
x  -  n  =  0. 


It  will  be  useful  later  to  include  upper  bounds  on  the  x-variables  in  the  set  K.  The  follow¬ 
ing  corollary  states  that  in  this  case  one  only  needs  a  finite  number  of  the  inequalities  of  ( ll . 

COROLLARY  2  Let  I  be  a  positive  integer  and  set  A' =  A  Pi  j(;.x)|()$  v  ^  I  |.  Let 
I.  be  the  set  of  all  (;,  v)  which  satisfy 


i 
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I-.I  rill  I  II  mu  A  / 

|‘l<(  ti  i|  1  ;  i \  I  I)  ■  v  1  ,  Ilk’ll  In  I’.oposilioo  !  .  A  I  ’since  / 

iiu  ■  ■:  |'"i  -  ilk  M.  I1U  mimue  system  (lk  it  lollops  immediately  thit  /  •  ( ,n  'll'-  <>ihcr 

■,  ,n  |  ;ct  r  \  i  lx  ,i  hit  t  ,ii  \  it  i/  I  hen.  ^  \?  1  and  //  C  I  .  whcic  >i  Ni  II  *  I. 

i!  membership m  /  implies 

I  X  'l  *  1  i  >  /  1 

I  n;i;j.  ti’’  loilowcd  In  1 1  h '  we  liiul  that  :  >  H'\.  u  1  implying  a  i  (  A 
On  .he  n.lici  'i.tnd.  if  «  -  f.  then  necessarily  -  l  Moreovu. 

i  -a  T  ltf  ~~  i,  1 1  I  f  -  / ) .  ( 

l>t: i  ’.lie  iig:.!  hand  sum  equals  by  1 171, 

I  I)  t  I  I  ,i/ )  1  +■  1  W  (  l  -  l.i/)  1 1  1 1 H  i  l  a/  ) 

v,  huh  is  merely  I  •  tfil.m.  1  herefore.  m  this  case 
•/Ml  .i/i 

.,iKi  *:,  \  i  •  K  .lis'i  I  Inis,  in  either  ease  •  j.a  I  e  A  ninth  implies  C..\  /  saiislies  the  entire  me- 
i ; e  v. xte-!-'  <  I '  b\  Proposition  1  lienee.  *r. \)  £  /  anil  lienee  /  s  /  Inereloie. 

/  /  A  Ilk  ll  \  l.’ids  A  I 

(  (  iRi  >1.1  \Kt  a  (.-.  i  i  is  an  extreme  point  of  A  if  ami  only  it  >  ^  a  iV'nneeatise  integer 
and  -  H  ■  ' 

PROOI  There  are  only  two  \ariables  rand  \  in  the  linear  inequality  system  tie  lienee 
extreme  points  ean  only  occur  on  the  boundary  of  A  at  the  intersection  ol  a  pair  ol  linearly 
independent  equations  By  Corollary  1  a  pair  of  linearly  independent  equations  arise  it  and  only 
It  \  is  a  nutmeg. dive  integer,  and  moreover,  each  nonnegalive  inleget  does  satis! \  two  tadja- 
vent1  lineal  I',  independent  equations.  I  Ins  includes  the  special  eases  of  the  endpoints  where  lor 
•I  io.  i  iic  additional  meuiiality  \  ^  U  is  used  and  at  I H  <  I .//  >.  1  >  the  ineqtialitv  \  s'  I  is  used 

* 1 1  )|*t  is,  1  h  iN  t  i kIci  the  bounded  variable  assumptions  made  m  beuioii  a  ' 

1 :  ,u : .  ipt i  mai  m  lu lion  1 1  \/  )  ,.S*.  •  e/  1 1  ol  L Pud  is  feasible  lor  LP  .  and 

til)  i‘  t'iv  each  high  isiee  gtoup  I 
•  I  i  •  >  /  *  a  *  .  d  / 

'hen  ;hts  i  nlinial  soinii  n  a  also  optimal  lor  program  I  P'  Moreover  there  *■>  :sl  <>  e  and  an 
ol  .lion  I  I  lJ  siieh  llial  with  respect  to  v  ,*  of  that  solution.  <2!>  holds 

PROOi-  h  n  id)  is  sirielh  satisfied  lor  anv  /  and  high  usage  i •.Pilin'  mm  *  '.ben  may  he 
.'■■el -is,  I  i  i  os  owe:  hound  without  aMecting  teavinluy  Thus  !( v,  >.v*  i  optimal  tor 
;  | ■ ,,  ■  ,  H  *  ,  i  a  i  loi  each  high  usage  calling  pan  :  By  (  oioli.u.  I  tm  each 
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high  usage  calling  pair  /  satisfies  (20)  lot  every  nonnegative  integer  Since  r'  ^  e,  .m  l 

i  |9t->  am!  1 1 I  a i e  alieaily  satistieil  follows  iluit  ;<  (;.'*>!  satisfies  all  i he  eonstrainis  <•; 

l  I’-.  This  proves  ( i ) 

I  he  liisi  pari  of  (ni  follows  Ironi  linear  programming  duality  theory  Because  ol  '21  i  llie 
two  dual  vaii.ihies  stemming  respectively  from  the  iwo  hounding  constraints  on  x,  are  both 
zero  Heine,  one  may  delete  these  constraints  in  I  I’m,  and  the  same  dual  optimal  solution  pre¬ 
vails  rhcrefotc.  by  duality  (e  )]  is  optimal  lot  the  relaxed-vart.thlc  constrained  pro 

gram  LP'  I  he  icmaimng  .statement  of  part  (ii)  follows  from  Corollary  I  and  the  fact  that  the 
numicgaiiv  v  imegci  s  n  ,  ,/J ,  satisfy /d ,  I  v/  2  2 
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ABSTRACT 

A  system  deteriorates  due  lo  shocks  received  at  random  times,  each  shock 
causing  a  random  amount  of  damage  which  accumulates  user  time  and  may 
result  in  a  system  failure  Replacement  of  a  failed  system  is  mandatory,  while 
an  operable  one  may  also  he  replaced  In  addition,  the  shock  process  causing 
system  deterioration  may  he  controlled  by  continuous  preventive  maintenance 
expenditures  The  joint  problem  of  optimal  maintenance  and  replacement  is 
analyzed  and  it  is  shown  that,  under  reasonable  conditions,  optimal  mainte¬ 
nance  rate  is  decreasing  in  the  cumulative  damage  level  and  that  beyond  a  cer¬ 
tain  critical  level  the  system  should  he  replaced  Meaningful  bounds  are  esta¬ 
blished  on  the  optimal  policies  and  an  illustrative  example  is  provided 


1.  INTRODUCTION 

In  this  section,  we  first  introduce  the  reliability  problem  treated  in  this  paper,  provide  a 
background  in  terms  of  the  relevant  literature,  and  summarize  our  assumptions  and  results 

A.  Problem  Statement 

Consider  a  system  that  receives  shocks  at  random  points  in  time,  each  shock  causing  a 
random  amount  of  damage  which  accumulates  over  time.  As  the  cumulative  damage  level 
increases,  the  rate  at  which  the  system  generates  revenue  declines  and  the  probability  of  its 
failure  increases.  Replacement  of  a  failed  system  is  considered  mandatory,  while  an  operable 
one  may  also  be  replaced,  a  forced  replacement  being  costlier  than  a  voluntary  one.  In  addition 
to  replacement,  the  damage  process  can  also  be  influenced  by  preventive  maintenance  expendi¬ 
tures;  higher  expenditure  rates  buffer  the  system  more  effectively,  and  hence  decrease  probabil¬ 
istically  the  frequency  of  occurrence  of  shocks  as  well  as  their  severity.  Our  problem  is  lo 
determine  an  optimal  policy  that  specifies  a  sequence  of  replacement  and  maintenance  expendi¬ 
ture  schedules  so  as  to  maximize  the  expected  discounted  net  profit  generated  by  the  system 
over  an  infinite  planning  horizon. 
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K.  Background  Literature 

The  extensive  literature  on  control  of  stochastically  deteriorating  systems  has  been  sur- 
veyed  by  McCall  1 1 0|  and  Pierskalla  and  Voelker  illi.  Of  particular  relevance  here  is  the  work 
by  Taylor  1 1 5 1 .  I'eldman  |b,7|  and  Abdel-Hameed  and  Shimi  ll]  on  optimal  replacement  of  a 
system  that  is  subject  to  shocks  and  failure,  based  on  the  theory  of  optimal  stopping  in  Markov 
processes  On  the  other  hand,  Thompson  |lh|  and  Kaniien  and  Schwartz  [8|  employ  optimal 
control  theory  to  characterize  the  time  pattern  of  optimal  maintenance  expenditures  that  retard 
the  system  failure  rate  In  the  former  class  of  models  the  only  decisions  available  tire  whether 
to  replace  the  system  or  not.  while  in  the  hitter  class  the  state  of  the  system  at  any  time  is 
described  as  being  either  working  or  failed  Our  model  incorporates  the  essential  features  ol 
both  of  these  two  classes  in  that  it  allows  for  varying  degrees  of  preventive  maintenance  (in 
addition  to  the  replacement  action )  as  well  as  a  more  detailed  description  of  system  deteriora¬ 
tion  (in  addition  to  its  description  as  working  or  failed).  Our  analysis  is  based  on  the  methodol¬ 
ogy  of  stochastic  dy  namic  programming,  as  in  Herman  |5l.  Ross  [1.1]  and  others.  Some  prelim¬ 
inary  work  along  these  lines  may  he  found  in  C'hikte  (31  and  Chikte  and  Kozin  [4 1 

(  .  Overview  of  Assumptions  and  Results 

In  Section  2,  we  define  the  state  of  the  system  in  terms  of  its  cumulative  damage  level, 
which  increases  randomly  due  to  occurrence  of  shocks  and  is  influenced  by  continuous  mainte¬ 
nance  expenditure  and  instantaneous  replacement  actions  The  probabilistic  rate  at  which  dam¬ 
age  accumulates  is  assumed  to  be  decreasing  in  the  maintenance  expenditure  rate  (  Assumption 
/’(ill  t  pon  receiving  a  shock,  the  system  may  fail  instantaneously  with  a  probability  that  is 
assumed  to  he  increasing  in  the  resulting  damage  level  but  at  a  diminishing  rate  (Assumption 
/'(;/((  II  the  svslent  does  tail,  it  must  he  replaced  instantaneously  by  a  new  one  at  a  fixed 
cost,  while,  even  it  it  does  not  tail,  it  may  still  be  replaced  voluntarily  at  a  lower  cost  (.Assump¬ 
tion  /  i  ii  1 1  \n  operating  system  continuously  generates  revenue  at  a  rate  which  decreases,  but 
at  a  diminishing  rate,  as  the  cumulative  damage  level  builds  up  (Assumption  /.(/)).  finally, 
we  also  iniioduce  a  condition  (  Assumption  R •  which  ensures  a  profitable  system  operation  (as 
in  I  hci  >tcm  4  i 

In  Section  v  we  first  show  that  the  maximum  infinite  horizon  expected  discounted  net 
ptolit  liont  svstem  operation  decreases  at  a  diminishing  rale  .is  the  cumulative  damage  level 
nutcases  (theorem  l>  We  then  show  that  it  is  optimal  to  replace  the  system  voluntarily  as 
soon  as  its  cumulative  damage  level  exceeds  a  critical  threshold  (Theorem  2)  As  to  the 
optimal  preventive  maintenance  policy,  we  show  that  the  maintenance  expenditure  rote  should 
be  reduced  as  the  cumulative  damage  level  builds  up  to  the  critical  value  (  Theorem  4)  finally, 
we  derive  *m  I  heorem  M  meaningful  bounds  on  the  optimal  policy  In  particular,  we  show 
that  postponement  ol  voluntary  replacement  cannot  be  optimal  if  the  extra  profit  from  system 
operation  until  the  next  shock  cannot  lustily  the  extra  cost  due  to  a  possible  failure  and  replace¬ 
ment  at  that  shock  We  also  show  that  the  optimal  maintenance  expenditure  rate  is  always 
strictlv  less  than  the  rate  at  which  the  system  currently  generates  revenue 

Section  4  provides  an  example  that  illustrates  the  model  and  the  results  and  Section  5 
concludes  the  paper  with  some  remarks  on  the  type  of  information  required  for  implementa¬ 
tion 

2.  MODI'. I  IORMI  I  ATION 

In  this  section,  we  lirst  establish  the  notation  and  define  the  basic  components  of  our 
model,  then  we  present  the  assumptions  made  and  finally  we  describe  the  overall  model 
dy  nannes 
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A.  Notation  and  Definitions 

l  et  .1  nonnegative  i >it.  variable  \  denote  the  cumulative  damage  level  of  the  s\ stem 
in  operation  at  time  t  '  h  it  is  the  sum  total  of  the  damages  suffered  due  ,.i  shocks  received 
by  the  system  b\  time  / 

I  he  damage  process  eliciting  \  is  controlled  by  means  ol  a  continuous  prev  ■  1 1 1 s e 
maintenance  expenditure  rate  in  e  |t>. A/ 1 .  where  < the  budget)  \l  <  Maintenance  i-  .>  med 
at  protecting  the  system  from  the  undesirable  environment  so  as  to  retard  the  rale  ai  uhu.li 
shocks  are  received  and  to  dampen  the  magnitudes  of  damages  inflicted  by  them  I  el 
Mint  >  I)  be  the  probabilistic  rate  .it  which  shocks  occur  if  the  maintenance  rate  is  in  I  bus.  il 
the  maintenance  rate  is  a  constant  in  thiough  time,  the  time  interval  between  successive  shocks 
is  exponentially  distributed  with  the  mean  1/A  (mb  l  et  a  nonnegative  random  variable  )  denote 
the  magnitude  of  damage  caused  by  a  shock  and  let  <7(-  I  ml  be  the  cumulative  distribution 
function  ol  ).  parametrized  by  the  maintenance  rate  in.  Thus.  A  <  /// )  1 1  -  (Hilwll  is  the  pro¬ 
babilistic  rate  at  which  shocks  causing  damage  m  excess  of  i  occur  if  the  maintenance  expendi¬ 
ture  late  is  m 

II  X  =  \  is  the  damage  level  tust  prior  to  time  t  and  if  the  system  receives  a  shock  at 
time  i.  causing  an  additional  damage  of  magnitude  t  Iso  that  A,  =  r  =  a  t  it.  then  the  s'  stent 
may  tail  instantaneously  with  a  probability  denoted  by /dr),  depending  on  the  new  cumulative 
damage  level  r.  while  with  probability  II  -  /><r)|  it  endures  the  shock  and  continues  to  operate 
If  the  system  fails  at  time  /.  it  must  he  replaced  immediately  by  a  new  one  at  the  forced 
replacement  cost  ( A  >  (I  fven  if  the  system  survives  the  shock,  it  may  still  be  replaced 
instantaneously  at  a  voluntary  replacement  cost  (  ,  >  0,  In  either  case,  the  replacement  deci¬ 
sion  at  time  t  will  be  denoted  as  </,.  =  I.  while  </,  =  ft  corresponds  to  the  nonreplacemem  deci¬ 
sion. 


It  A.  =  .v  j?  0.  let  Hv)  ^  (I  denote  the  instantaneous  rate  at  which  the  system  generates 
revenue  from  its  operation  Suppose  that  future  revenues  and  costs  are  discounted  column 
ously  at  rate  n  >  ft,  so  that  c  is  the  present  value  of  one  dollar  earned  t  time  units  Mom 
now 


By  a  (replacement  and  maintenance)  policy  b  we  mean  a  pair  hS!.z>  i  ol  functions  of  me 
system  state  v.  denoted  as  b.-|0.  *•>  —  jo.  ||  and  by  Ift.^o)  -  |ft.  l/|  Here,  the  replacentent 
rule  b,  specifies  leplacement  ol  the  system  in  state  \  (which  is  mandatory  il  the  system  is 
down)  if  ft.l.xl  =  I.  while  6 ,  ( x  i  -  ft  specifies  the  nonreplacement  decision.  Similarly,  il  me 
system  is  in  state  the  maintenance  iuIc  ft-  specifies  a  maintenance  expenditure  rale  n  i  v  i  • 
[ft.  1/1  In  light  ot  the  results  bv  Stone  1 1 4 1  and  I’liska  1 1  21  on  controlled  tump  pioccssis.  a  i, 
reasonable  to  stipulate  that  o  icviscs  the  teplacement  and  maintenance  decisions  only  . 

times,  tlepemling  on  the  state  ol  the  system  then 

I  inally.  let  I  1  denote  the  net  expected  discounted  return  from  employing  the  pom  . 
over  an  infinite  planning  horizon,  starting  with  a  system  m  stale  v  ’ft  I  et  I  '  v  1  -up 

l  „ ( v  )  be  the  ni.iximum  possible  return  obtainable  A  policy  b'  is  said  to  be  optimal  d  t  !  .  ' 
=  fly)  for  all  x  -J  ft  In  order  to  characterize  the  optimal  return  function  I  and  the  opt  mil 
policy  b  *,  we  need  to  make  certain  assumptions  on  the  model  parameters 

B.  Assumptions 

Regarding  the  etlci  uvctiess  ol  pi  ventive  mainienaiice  expendiiuies  in  d.unpi  inng  o 
’lock  process,  we  assume  that  higher  expenditure  tales  m  pmtecl  the  system  Imite  .eul  as  s,u 
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result  m  lower  probabilistic  rates  A  ( m  !  ll  (>'<y  IwM  at  which  additional  damages  in  excess  of 
any  given  c|uanntv  t  occur  llhis  is  analogous  to  the  stochastic  monolonicily  assumption,  as, 
for  example,  m  Dorman  i 5 1  I  As  to  the  system  failure  process,  it  is  reasonable  to  suppose  that 
the  system  may  tail  only  at  shock  times  and  that  the  probability  />(:)  of  its  failure  increases  in 
the  resulting  cumulative  damage  level  ~  but  only  at  a  decreasing  rate  We  slate  these  probabilis¬ 
tic  assumptions  as 

Assumption  /’ 

(i)  for  any  fixed  i  ^  I),  A  hnl  ll  -  G(i|/n)|  is  continuous  and  nonincreasing  in 
in  i  Id.  A/ 1  In  particular,  taking  i  =  0.  A  (in)  is  continuous  and  nonincreasing  in 
m  f  |0. A/ ] 

(ii)  The  failure  probability  />fel  is  nondecreasing  and  concave  in  the  cumulative  dam¬ 
age  level  0* 

With  respect  to  the  economies  of  the  system  operation,  we  assume  that  an  operating  sys¬ 
tem  in  state  v  -s  d  generates  revenue  at  rate  r l.v )  which  is  nonincreasing  and  convex  in  the 
cumulative  damage  level.  This  reflects  a  degradation  in  the  system  performance  as  the  damage 
accumulates  hut  at  a  diminishing  marginal  rate.  On  the  replacement  cost  side,  we  assume  that 
the  cost  (  •  of  replacing  a  failed  sy  stem  is  higher  than  the  cost  C,  of  a  voluntary  replacement  of 
a  working  system  (possibly  due  to  the  salvage  value  differential),  thereby  providing  an  incentive 
to  replace  the  system  before  failure.  Also,  to  make  this  system  operation  and  replacement  a 
worthwhile  undertaking,  it  is  essential  that  the  cost  C,  of  a  voluntary  replacement  be  compen¬ 
sated  for  by  the  present  value  /  (())/<»  of  the  infinite  horizon  revenue  that  a  system  maintained 
in  mint  condition  would  generate  We  summarize  these  economic  conditions  as 

I ssumptton  /.. 

( i )  The  revenue  rate  r  ( v )  is  nonnegative,  bounded,  noninereasing  and  convex  in  the 
damage  level  v  >  0  and  i  (())/«  > 

<ii)  The  replacement  costs  C|  anil  C\  satisfy  C>  >  C,  >  0. 

I  he  above  assumptions.  P  and  will  be  used  to  characterize  properties  of  the  optimal 
value  function  (  and  the  maintenance  and  replacement  rules  (in  Theorems  1,2  and  3),  while  to 
show  that  f  is  positive  (in  Theorem  41  and  to  provide  bounds  on  optimal  policies  (in  Theorem 
si  we  impose  the  following  simple  and  easily  verifiable  condition  on  the  problem  parameters, 
which  ensures  that  the  overall  operation  of  (he  system  is  a  profitable  one. 

Assumption  R.  There  exists  an  m'  €  [O.A/I  such  that  m *  ^  lim  z(.v)  and 
|r  (0)  -  III  *1/  |(t  +  A  ( ill  * )  |  ^  (A. 

It  says  that  the  net  expected  discounted  profit  generated  by  a  new  system  that  is  main¬ 
tained  at  a  small  enough  expenditure  rate  m*  until  the  next  shock  makes  up  for  the  cost  of  a 
failure  replacement  that  might  be  necessary.  Generally  speaking,  if  the  revenues  generated  by 
operating  the  system  are  "high"  enough  in  relation  to  the  replacement  costs,  if  the  shock 


’ll  is  possible  lo  icl.tx  Assumption  /'•//»  h\  requiring  ih.it  >  be  uhk.ivc  onl\  on  the  region  ol  v.ilues  ol  j  on  which 
/» ( .■  *  i>  not  cqu.il  to  one 
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process  is  sufficiently  "slow"  and  "mild”  and  if  the  failure  probability  is  "small"  enough,  then  it  is 
possible  to  make  the  system  operation  a  profitable  one;  Assumption  R  constitutes  one  particular 
set  of  such  conditions. 

C.  The  Model  Dynamics 

finder  a  policy  8  =  (fii.fid,  the  cumulative  damage  process  ! X,.t  ^  0)  evolves  as  a  non¬ 
terminating  pure  jump  process  as  follows.  If  the  state  of  the  system  in  operation  at  time  i  is 
X,  =  v  and  if  the  replacement  rule  specifies  6t<  \  )  =  I  then  the  system  is  renewed  instantane¬ 
ously,  yielding  X,.  =  0  at  a  voluntary  replacement  cost  C|,  while  =  0  leaves  the  system 
state  unchanged  until  the  next  shock.  Given  A,,  =  v  >  0,  the  maintenance  policy  specifics  a 
continuous  maintenance  expenditure  rate  8:(.v)  €  l<).  f/1.  Then  the  sojourn  time  S  in  state  x  is 
exponentially  distributed  with  parameter  AfSjf.vD.  During  this  interval,  Ihe  system  generates 
revenue  at  rate  r(.v)  and  is  maintained  at  an  expenditure  rate  8_>(\ )  and  thus  yields 

A  (w)e 

—  (rf.x )  —  8;(.v)]/I«  +  A(Sif.v))] 

as  the  expected  discounted  profit  until  the  next  shock.  Similarly,  the  net  return  from  the  next 
shock  onwards  will  be  discounted  by  the  expected  discount  factor 

J  c  A  (8  ,(x  ) )  e  AlS-'  ls  ds  =  A  (8  ,(x ) )/(«  +  A  (S  -d.v))]. 

The  next  shock  causes  damage  of  magnitude  y  according  to  the  distribution  G(dy\m),  so 
that  the  postshock  state  is  X,  .  ,  =  v  T  v.  At  that  instant  the  system  fails  with  probability 
p  f.v  +  v  ),  in  which  case  it  must  be  replaced  (i.e.,  5,(x  +>)=  1)  at  cost  C  2,  so  that 

V .  =  0  If  the  system  does  not  fail,  which  happens  with  probability  [1  —  p(x  +  y)],  and  if 

8  |  ( v  +  v )  =  1 ,  the  system  is  replaced  at  cost  and  X,,  tu  +  =  0,  while  if  8 ,  (x  +  y)  —  0  then 
the  system  continues  to  operate  in  slate  X,,  ,,M  =  x  +  >•.  In  any  case.  8,(A',,V>I  J  is  the  expen¬ 
diture  rate  at  which  the  system  is  maintained  until  the  following  shock,  and  the  process  repeats. 

Our  objective  is  to  investigate  an  optimal  decision  rule  8*  =  (8  5  2*)  which  specifies  the 
replacement  and  maintenance  decisions  8  ,’(x )  and82(x)  as  functions  of  the  cumulative  damage 
level  x  at  each  shock  instant,  so  as  to  yield  the  maximum  expected  discounted  net  return 
U*(x>  =  f'(x)  for  each  x  >  0.  In  the  next  section,  we  analyze  this  problem  in  the  stochastic 
dynamic  programming  framework. 

3.  OPTIMAL  RETURN,  REPLACEMENT  AND  MAINTENANCE 

In  subsection  A  below,  we  first  provide  the  dynamic  programming  functional  equation 
satisfied  by  the  optimal  return  function  f'(x),  which  is  then  shown  to  possess,  under  Assump¬ 
tions  P  and  K,  certain  "nice"  properties.  In  subsection  B,  we  make  use  of  these  properties  of  V 
to  characterize  the  structure  of  optimal  rules  8 and  8  ,’,  while  in  subsection  C.  Assumption  R  is 
employed  to  derive  interesting  bounds  on  8  and  8 

A.  The  Optimal  Return  Function 

In  order  to  analyze  the  optimal  return  Ulx),  we  first  define  E„(.v)  as  the  maximum 
expected  discounted  profit  over  an  infinite  time  horizon,  starting  with  a  system  in  state  v  and 
given  that  exactly  n  more  shocks  will  eventually  occur.  This  is  analogous  to  the  approach  in 
Lippman  [ 8 1  and  enables  us  to  interpret  n  as  the  time  index,  yielding  the  Bellman  dynamic  pro¬ 
gramming  recursion  in  a  discrete  time  format  as  follows. 
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X,  X,  * . |Hv)  - w|f/" 
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;,K  s  |)  (  ink  1 1  \\l  ■  s  p  ni  sliMl  kll 

I  or  .ill  n  =  1 ,  2 . anil  x  2  0, 

Ml  l.  lxi-  Max  |  l  ,  (01  -  U„  (x  )|. 

where 

1 4 1  (  ( x  I  -  Max  /,,,  I  ,  I  x  l . 

•ii  ■  i.i  ui 

and  (ho  operator  /„,  is  defined  hi 

(>>  /„  1  ,  ;(X  )  -  |/  (X  )  ~  111  +•  f 

+  l‘„ 


I  r„  i  <  x  +  r  •  1 1  -  p  (  x  +  v  )| 

i  (01  -  (';!/>  (.v  +  i)|\  <m  I  (7  (i/v \m  )\/[a  +\(m)] 


and 

(h)  I  ,.( x )  =  Max(r  (())/<*  -  C,.  Hx)/o}. 

These  equations  may  be  interpreted  as  follows.  It  the  system  in  stale  x  facing  n  more  shocks  is 
replaced  voluntarily,  the  net  optimal  return  would  be  f„  (0)  —  C'(.  since  n  more  shocks  still 
remain.  On  the  other  hand,  maintaining  it  at  rate  m  yields  |r(x)  —  ml/ [<»  +K  (m  )|  as  the 
expected  discounted  profit  until  the  next  shock,  according  to  (I).  If  the  next  shock  is  of  mag¬ 
nitude  i  (determined  according  to  (Hdx !w)>.  the  optimal  return  from  then  on  is  V„  ,  (x  +  y) 
provided  the  system  survives  the  shock  li  e.,  with  probability  [1  -  p(x  +  ,y)l)  and 
l,  |tm  -  (  otherwise,  discounted  by  the  expected  discount  factor  \(m)/[u  +  Aim)],  as  in 
>2>  finally,  with  no  more  threat  of  future  shocks  (i.e.,  n  =  0),  mainlenance  expenditures  are 
unnecessary  and  we  may  or  may  not  replace  the  system,  which  will  be  operated  from  then 
onwards  without  further  deterioration. 

LEMMA  1  Under  Assumptions  P  and  F.  for  each  n  —  0.1.2 . the  functions  U„(x) 

and  L’,(x)  are  bounded,  nonincreasing  and  convex  in  x  >  0. 

PROOP:  Boundedness  follows  from 

(7)  r  (())/<*  ^  l„(x)  7?  -  \\1  +  (  :[a  +  A(.W)ll/«.  , 

since  /  (()>  is  the  highest  rate  of  return  obtainable,  while,  in  the  worst  case,  infinitely  many 
shocks  occur  and  each  requires  a  failure  replacement  in  spite  of  employing  the  maximum  possi¬ 
ble  maintenance  rate  To  prove  monotonicity  and  convexity  of  by  induction  on  n,  note 
from  (6)  and  Assumption  F  (i)  that  l  „  has  these  properties.  Suppose  that  V„ .  i  is  nonincreas¬ 
ing  and  convex.  From  ( 3 >  and  Assumption  F  (ii)  we  have  V„  \  (x  +  y)  7?  K„_|  (0)  —  C2. 
Using  this,  together  with  the  induction  hypothesis  and  Assumption  P,  it  can  be  checked  that, 
for  each  v,  the  integrand  in  (5>  is  nonincreasing  and  convex  in  x.  This,  together  with  Assump¬ 
tion  I  (i)  yields  monotonicity  and  convexity  of  Tm  V„  ,  for  each  m.  Since  these  properties  are 
preserved  under  the  maximization  operation,  we  have  U„  and  hence  nonincreasing  and  con¬ 
vex 

Q  ED 

from  the  definition  of  l„,  it  is  easy  to  see  that  V„  <  V„  |  for  all  n  —  1,2 . i.e.,  per¬ 

mitting  more  shocks  can  not  improve  the  total  return  obtainable.  Thus,  the  sequence  of  func¬ 
tions  1  l  „,  n  =  0,  1 , 2 . 1  is  bounded  as  in  (7)  and  nonincreasing,  so  that  V—  lim  V„  exists 

and  is  the  maximum  net  expected  discounted  return  over  an  infinite  horizon,  given  that  an 
unlimited  number  of  shocks  will  eventually  occur.  By  standard  contraction  mapping  arguments, 
T  is  the  unique  solution  to  the  following  functional  equation,  which  is  similar  to  (3)  and  (4). 

(SI  l  (x  l  =  Maxi  f  (0)  -  C,.  U(x) |.  x  >  0 


where 
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(9)  U(x)  =  Max  7„,  Ha) 

m  »  |l).  \/l 

and  T,„  is  the  operator  defined  in  (5). 

Sinee  the  properties  of  U„  and  V„  is  Lemma  1  are  preserved  upon  taking  limits  as  /;—<». 
we  have  the  following. 

THEOREM  I:  Under  Assumptions  P  and  E.  the  optimal  value  funetions  U(x)  and  Ha) 
are  bounded,  nonincreasing  and  convex  in  v  >  0. 

B.  Optimal  Replacement  and  Maintenance  Policy 

From  (8),  it  is  clear  that  the  optimal  replacement  rule  8*  specifies  the  replacement  deci¬ 
sion  8,  (a)  =  I  in  state  ,v  if  and  only  if  Ha)  =  HO)  -  Similarly,  the  optimal  mainte¬ 
nance  rule  8/  specifies  in  state  .v,  the  smallest  expenditure  rate  82*(a)  which  attains  the  max¬ 
imum  of  rml  (.v)  in  (9)  over  m  €  (0,,V/1;  our  continuity  and  compactness  assumptions  assure 
the  existence  of  this  maximizer. 

We  first  show  that,  in  our  model,  the  optimal  replacement  rule  8*  has  the  well  known 
control  limit  form  (as  in  the  models  of  Derman  [S],  Ross  [13],  Taylor  [15],  Feldman  [6,7]  and 
others) 

THEOREM  2:  Under  Assumptions  P  and  E,  there  exists  an  x*  €  [0,°°]  such  that  8,‘  (a) 
=  I  if  and  only  if  a  ^  a*. 

PROOF:  By  monotonicity  of  U  and  definition  of  8*,  we  may  define 

(10)  ,v*=  inf  (a  ^  0:H0)  -  C,  >  U(x)). 

Q.E.D 


Next,  we  show  that  the  optimal  preventive  maintenance  expenditure  rate  is  nonincreasing 
in  the  damage  level  of  the  system.  This  may  be  viewed  as  a  stochastic  analog  of  the  result  of 
Kamien  and  Schwartz  [8]  and  Thompson  [14],  wherein  the  optimal  maintenance  rate  is  shown 
to  be  decreasing  in  the  chronological  age  of  the  system.  Indeed,  it  is  reasonable  to  expect  a 
reduction  in  continuous  maintenance  as  instantaneous  replacement  becomes  more  imminent. 

THEOREM  3:  Under  Assumptions  P  and  E,  the  optimal  maintenance  rate  82’(a)  is 
nonincreasing  in  a€  (0,a‘),  where  a*  is  given  by  (10). 

PROOF:  If  a  <  a*,  then  from  (8)  and  Theorem  1  we  have  E(x)  =  U(x),  which  can  be 
seen  to  be  equivalent  to 

(11)  <*V(x)=  Max  [r(x)  —  m  —  J'(x,m )] 

me  lowi 

where 

(12)  f(x,m)  =  f  [[F Ta)  —  ,P(a+v)] 

j  0 

+  [y(x+y)  -  E(0)  +  c2l  />(a+v))  k(m)G (dy\m). 

Take  A|  <  a2  <  a*,  so  that  we  need  to  show  that  f>‘  (aj)  ^  82*  (a2).  We  first  show  that 
|./'(a2,«7)  —  /(A|,w)]  is  nondecreasing  in  m  6  [0, M\.  Now 

03)  [/(x2,m)  —  /(x|,m)]  =  J  a(a|,a2>>' )A(w)  (Hdv\m) 
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where 
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(14)  .el.xq.xN.y )  =  l>  <v2)  -  (  <a,4i  )|  -  |1  (.V, )  -  n.v,+.»’)| 

+  [I  (a24v)  —  1  (())  4-  (  2|  pix2  4  l  ) 

—  1 1  (_V|  4  r )  —  I'  (0)  4  (  2)  /> (.v |  4  i  ) 

=  (I  (.V;)  -  1  ( .V  | )  1  4  [(  (  V|4r  )  —  I  (.Vj4y)|  [1  —  p  (A|  a  v )  I 

4  [l  (,XN4y)  —  (  (())  4  (  i|  [p  (an4  r  )  —  p  (a'|4v  ) ) . 

By  monotonicity  and  convexity  of  l  (Theorem  1)  and  monotonicity  of  p  (Assumption  P  (ii)), 
the  second  term  in  the  above  expression  is  nonincreasing  in  y.  Also  monotonicity  of  F,  con¬ 
cavity  of  p  and  the  inequality  F(x)  -  (  (0)  4  C2>  F(.x)  —  F(0)  4  (',  >  0  (since  C2  >  C: 

and  (  satisfies  (8))  imply  that  the  third  term  is  nonincreasing  in  >\  Thus,  y<Xj.x2 ,y)  is  nonin¬ 

creasing  in  y.  This,  coupled  with  Assumption  P(i)  now  implies  that  [/(x2,m)  —  f(x\,m)\  is 
nondecreasing  in  [0,.V/|.  Since  82*(a)  attains  the  maximum  on  the  right  hand  side  of  (11), 
the  above  implies  that  82*(a'|)  ^  82’( an),  whenever  an  >  X|,  because  otherwise  we  would  have 


|r(v:)  -  h* (an)  -  /(an,8,*(an))| 
<  (/-(an)  -  8i"(a'i )  -  ./<an.82*(x,))| 
i.e. 


\r  (a  | )  -  82‘(a'2)  -  /(.V|,8 2* (-X 2 ) )  1 
\r  (a  |  >  -  8  2*  (  a  ] )  -  /(.Y],82*(.V|))] 


lr (an)  -  S2*(an)  -  ./ (an.8  *(.v2))l  4  [/-(a,)  -  82*(A|)  /(A|,fi2‘(.V|))] 

<  [r  (an)  -  82*(a'|  )  -  ./  (  v>.82*(a  , ) )]  4  [r  (a,)  -  82*(an  -  /(a'|,82*(.v2))], 
contradicting  optimality  of  8  *(  v)  when  in  state  a. 


0  E  D. 


Thus,  by  Theorems  2  and  3,  the  higher  the  state  of  deterioration  of  the  system  the  less 
should  be  the  maintenance  effort  to  prevent  further  deterioration  and,  as  soon  as  the  deteriora¬ 
tion  level  exceeds  a  critical  value,  the  system  should  be  replaced  by  a  new  one. 

C.  Bounds  on  Optimal  Policy  and  Return 

So  far,  with  Assumptions  P  and  E,  there  is  no  guarantee  that  even  the  optimal  policy  will 
result  in  a  profitable  system  operation  over  the  longrun.  This  is  precisely  the  purpose  of 
Assumption  R,  as  the  following  Theorem  4,  shows  and  this  fact  will  also  be  needed  to  establish 
bounds  on  x*  and  8:*(  \ )  in  Theorem  5  below. 

THEOREM  4:  With  Assumptions  P,  E  and  R,  the  optimal  return  F(a)  is  positive  for  all 
a  ^  0. 

PROOF:  Consider  a  policy  8  =  (S|,82).  where  8|(.v)  =  1  and  82(x)  =  m*  for  all  x  ^  0, 
where  m *  is  as  in  Assumption  R;  thus  8  replaces  the  system  at  every  shock  and  always  specifies 
the  constant  maintenance  rate  m*.  The  total  expected  discounted  return,  starting  in  state  ,v  and 
following  this  policy  8,  is,  therefore 

Fft  (a  )  =  (r  (a  )-w*|/I«  4  A  <m*)l  4  A  (m*)  (r(0)-w*|/[a|«  4  A  ( w  * )  1 1 
—  K  ini')  I A  (a)  4  Kim*)  K  <0)/u]/[a  4  K  (m*)l. 


K( a)  =  C,  4  <(W, 


pix+y)Gidy\m*) 


where 
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is  the  expected  replacement  cost  upon  receiving  a  shock.  Since  C \  >  A(x)  >  A  (0).  we  have 
I*  Tv)  ^  [ /•  ( .v )  —  »?*|/|«  +  A(/«*)|  +  A(w*)/«{  +  A(w*)|— C2)  >  0. 

by  Assumption  R  Since  I(x)  ^  l  A (  v )  for  all  x,  the  proof  is  completed. 

Q  ED 


Our  final  objective  is  to  derive  bounds  on  the  optimal  policy  8  * 

THEOREM  5:  Under  Assumptions  P,  E  and  R,  we  have 

(15)  .v*  <  b. 
where  b  =  inf  B, 

B  =  {.x  ^  0;  Max  ||r(x)— m\/k(m)  -  iC'2~  Cj)f  p(x+v)Gidv\m)\  ^  0) 

mf  lO.A/l 

and 

(16)  8*(x)  <  r(x).  x€[0,x*). 


PROOF:  To  prove  (15),  in  view  of  (8),  it  suffices  to  show  that  F(x)  >  U( x)  whenever 
a  €  B.  Suppose  a  €  Band  Fix)  =  U(x).  Now 


U(x)  <  Max  !  Hx)-m  +  f°°  {  F(x)Il-/>(x  +  v>] 
meio.vi 


+  |F(0)  -  C'2\p (x  +  y)|A(/n)Cr(t/y  |m)}/[a  4-  A(m)) 
<  V (x)  Max  {A(m)/[«  +  A(w))) 

mi  I0..V/] 

+  Max  (r(x)  —  m  +  f  [-  T(x) 
mdO.MI  *3" 

+  V (0)  —  C2\p(x  +  y)  h  (m)G(dy\m)\l\ct  +  A (m )] 


<  F(x)  Max  (A(m)/[a  +  \(/n)]) 

mi  (0..V/I 


+  Max  \r(x)  -  m  -  f  (C2  _  C’i )/?  (  v  +  >)A  (m)G  (dv  I  w)(/[a  +  A  (m )] 
mi  o  wl 


mi  la., Ml 

^  V(x)  Max  A(m)/[c*  +  A(w)] 
mi  In.  vrl 

<  F(x), 


yielding  a  contradiction.  In  the  above  argument,  the  first  inequality  follows  from  F(x  +  >■) 
^  F(x)  (Theorem  1),  the  third  one  from  F(x)  >  F(0)  —  C|,  the  fourth  one  from  the  fact 
that  x€B  and  the  last  one  from  V(x)  >  0  (Theorem  4).  To  prove  (16)  by  contradiction,  sup¬ 
pose  that  8 2*  (x)  ^  r(x)  for  some  x€  (0,x*).  Then 

V  (x)  =  Uix) 

=  |r(x)  -  82*(x)  +  f°°  {  V  (x  +  v)ll  -  pLx  +  y)] 

J  0 

+  I  V  (0)  -  C21/7(x  +.y))A(S2'(x))G'(</v|82‘(x))}/[a  +  A  (82’(x))l 

^  A  (82*(x))/|«  +  A(82*(x))]  f"  \V(x  +y)|l-/>(x  +y)l 

v  0 

+  (F(0)  -  C2)/>(x  +  y))G(r/v|82*(x)) 
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^  X  <8  ,*(x  ) )/(«  +  A  (8  ,‘(x  ) )  |  (  V(x ) 

+  f  |f'(0)  -  C\-  l'(x)lp(x  +  v)G(Jv|8;(x))) 

•Ml  *■  " 

<  l(x)A  (ft  ‘(  v )  )/U*  +X(82*(x))1 

<  K.v), 

again  yielding  a  contradiction.  Here  the  first  two  equalities  follow  from  the  definitions  of  x* 
and8:\  respectively.  The  first  inequality  follows  from  the  hypothesis  that8*(x)  >  r(x),  the 
second  inequality  from  monotonicity  of  E,  the  third  one  from  x  <  x*  and  the  last  one  follows 
by  positivity  of  I 

Q.E.D. 

The  bound  in  (15)  may  be  interpreted  in  terms  of  a  "one-stage-look-ahead"  stopping  pol¬ 
icy,  as,  for  example,  in  Ross  (13,  p.  1831  Suppose  we  postpone  the  voluntary  replacement  of 
an  operating  system  until  the  next  shock  in  the  hope  of  "squeezing”  additional  revenue  out  of 
it  However,  such  a  postponement  would  involve  the  risk  of  a  higher  forced  replacement  cost 
due  to  possible  failure  the  next  shock  might  cause.  The  first  part  of  Theorem  5  in  essence  jux¬ 
taposes  these  two  conflicting  factors  in  specifying  an  optimal  replacement  strategy.  It  asserts 
that  if  the  net  expected  revenue  until  the  next  shock,  |/  (x)  —  m]/\im),  cannot  at  the  least 

s%  oo 

overcome  the  expected  extra  cost  <C2  -  C"( )  J0  pix  +  y)Gidv\m)  due  to  possible  failure 
replacement  at  the  next  shock,  for  any  choice  of  maintenance  rate  m,  then  it  is  best  to  replace 
the  system  right  away  instead  of  waiting.  The  second  part  of  the  theorem  says  that  "living 
beyond  one's  means"  cannot  be  the  best  maintenance  strategy,  even  in  a  favorable  environ¬ 
ment,  i.e.,  that  the  optimal  maintenance  rate  is  always  strictly  less  than  the  rate  at  which  the 
machine  generates  revenues,  as  given  in  (16). 

4.  AN  EXAMPLE 

In  this  section,  we  illustrate  the  model  and  results  by  providing  explicit  solutions  for  a 
specific  example.  Consider  a  system  which  fails  when  the  cumulative  damage  first  exceeds  a 
prespecified  threshold  d  (see,  e  g.,  Buckland  [2],  Section  1-10),  so  that  the  failure  probability 
function  pi  )  is  given  by 

(17)  jo  if  0  <  r  <  d 

p{:)  =  {l  if  z  >  d 

which  is  trivially  nondecreasing  and  concave  on  [0,d)  as  per  (the  footnote  of)  Assumption 
P(ii).  Suppose  that  the  shock  rate  \(m)  =  X  >  0,  independent  of  the  maintenance  rate 
mtlO.Ml,  and  that  each  shock  causes  either  zero  damage  (so  that  the  system  survives)  with 
probability  ml M  or  damage  of  magnitude  d  (resulting  in  a  system  failure)  with  probability 
1  —  m/M,  i.e.,  the  distribution  of  damage  caused  by  a  shock  is 

(18)  im/M  if  0  <  y  <  d 
Giy\m)  =  j  ,  jf>  ^  d 

Then,  X(m)  [1  -  Giy\m)\  is  (linearly)  decreasing  in  m,  as  required  in  Assumption  P(i).  We 
may  take  the  economic  parameters  r,  C,  and  C2  to  be  arbitrary  ones  satisfying  Assumption  E, 
although  for  expositional  simplicity  we  take  the  reward  function  r(x)  to  be  strictly  decreasing 
and  convex  in  x  (e  g.,  f  ix)  =  ke  '  with  k  >  0)  and,  to  rule  out  trivial  solutions,  we  suppose 
that 


\ 
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(19)  [/  (0)  -  /(</)]/(«  +\)  =5  C 

The  optimality  equations  (8)  and  (9)  now  become 

(20)  Fix)  =  Max  (F(0)  -  C',. t/(.v )) 
where 

(21 )  (or  +  X  )  U (.v )  =  Max  (/  (x)  —  m  +  A  [  V{x)m/ M  +  (  MO)  —  Cj)  (1  —  m/ M)  1). 

rn  *  IO.A/1 

The  optimal  solutions  F,  8  *  and  82‘  depend  upon  relative  magnitudes  of  certain  problem  param¬ 
eters,  as  given  in  the  following  three  disjoint  and  exhaustive  cases.  In  each  case,  it  can  be 
verified  in  a  straightforward  manner  that  the  given  solutions  satisfy  (20)  and  (21). 

CASE  (i):  C\  -  C,  >  M/K. 


In  this  case,  the  optimal  return  is  the  convex  nonincreasing  function  given  by 


(22) 


[/  (jc )  —  M]/a  if  x  <  x* 
1/  (0)  -  M\/a  —  C |  if  x  >  x* 


where  the  critical  replacement  level  x*  satisfies 


(23)  /  (x*)  =  /  (0)  -  a  C,. 

In  light  of  (19)  and  the  strict  monotonicity  of  /,  we  have  x*  <  d  and  that  x*  is  unique.  As  for 
the  maintenance  rule,  we  have  8,*(x)  =  M  for  all  x€[0,x*),  specifying  the  maximum  mainte¬ 
nance  rate  until  replacement,  since  in  this  case  the  replacement  cost  differential  is  higher  than 
the  maximum  maintenance  cost  until  failure. 


CASE  (ii):  C2  -  Cj  <  M/K  <  C2. 


In  this  case  the  solution  turns  out  to  be 

«£  x  <  x 
<  x  ^  x* 
>  x* 

where  x*  satisfies 

(25)  r(x*)  =  r (0)  -  a  C,  -  M  +  A  (C2  —  Cj) 

and  x  satisfies 


(24) 


F(x)  = 


IHx)  -  M\/a , 

[/  (x)  +  A  [(/•  (0)  -  M)/a 
1/  (0)  -  M)/a  -  C, 


0 

C2]|/(a  +  A ),  x 
x 


(26)  r(x)  =  ;T0)  —  a  \  C 2  ~  M/K], 


Again,  8  *  specifies  replacement  whenever  x  >  x*.  The  optimal  maintenance  policy  8/  is  of  the 
"bang-bang"  type  and  is  specified  in  terms  of  the  switchpoint  x  (which  is  less  than  x*  since 
(C2  -  C i )  ^  M/K  )  as  follows: 


(27) 

82*(x) 


M  if  0  ^  x  <  x 
0  if  x  <  x  <  x*' 


From  (26),  note  that  x  is  increasing  in  C2. 


CASE  (iii)  C'j  -  C,  <  C2  <  M/K. 


In  this  final  case,  we  get 


i 
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r(x)/(«  +  X)  +  X|r(0)/(a  +  X)  —  (',]/«  x  <  x * 
r  (())/<»  —  C ’ |  —  X  (\/<«  V  ^  x * 

where  the  eonlrol  limit  v*  satisfies 
(24)  /  (a*)  =  r (0)  -  (<*  +  X  )  C, 

and  identically  zero  maintenance  rale  (i.e.,  8,"  (x)  =  0  for  all  x€[0.x*))  is  optimal. 

In  all  three  cases.  From  (24),  (25)  and  (29).  we  observe  Ih at  the  optimal  control  limit  x* 
is  decreasing  in  C\  (or  (('.  -  (',))  and  increasing  in  C',.  Similarly,  the  switch-point  x  at  which 
the  optimal  maintenance  rate  switches  from  W  to  0  is  increasing  in  C2  (or  (C>—  C,))  and 
decreasing  in  CY  Thus,  the  higher  the  replacement  cost  differential  (C;  -  C|),  the  greater 
should  be  the  intensity  of  preventive  maintenance  and  replacement  effort.  F:or  a  concrete 
example,  consider  J  =  I ,  r  (  x )  =  I  -  x  and  >  0  fixed.  Then  Figure  1  displays  the 
parametric  behavior  of  the  optimal  critical  value  x*  (shown  by  the  solid  line)  and  the  optimal 
switch  point  x  (shown  by  the  broken  line)  as  the  forced  replacement  cost  C2  is  varied. 

Finally,  notice  that  under  optimal  policy,  in  cases  (i)  and  (ii)  at  most  one  replacement 
ever  takes  place,  while  in  case  (iii)  the  system  is  replaced  at  every  shock.  In  short,  our  analysis 
has  delineated  conditions  under  which  it  will  be  optimal  to  actually  utilize  the  maintenance 
capability  available  for  buffering  the  system  completely  from  shocks. 


(28) 

I  (  v  )  = 


(ct+x)c14 


ocC1  + 


x  =  a(C2-M/X) 


M/X 

I  HA  HI  I 


-  C2 


CT  +  M/X 


5.  CONCLUDING  REMARKS 

In  this  paper,  we  have  integrated  the  problems  of  determining  optimal  preventive  mainte¬ 
nance  and  replacement  schedules  for  a  system  that  is  subject  to  stochastic  deterioration  and 
failure  induced  by  a  shock  process.  Under  reasonable  assumptions,  we  have  proved  that  the 
maximum  obtainable  return  and  the  optimal  policies  have  appealing  features  and  we  have  illus¬ 
trated  these  by  means  of  an  example.  We  conclude  the  paper  by  discussing  some  implementa- 
tional  aspects  of  the  model 
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In  practice,  the  slate  of  the  system  v  may  be  observed  in  terms  of  some  convenient  surro¬ 
gate  measure  of  system  efficiency,  accuracy  or  wear  such  as  the  production  (or  revenue)  rate, 
fraction  defective  produced,  energy  consumption  rate,  etc.  Accounting  and  financial  informa¬ 
tion  may  be  used  to  estimate  the  discount  rate  and  replacement  costs  and  which  depend 
upon  such  economic  factors  as  wage  levels,  prices  and  opportunity  costs  of  lost  production  dur¬ 
ing  replacement  delays.  Statistical  estimation  of  the  parameters  \  and  G  of  the  shock  process 
and  the  failure  probability  p  would  require  observations  on  the  system  performance  together 
with  simulation  experiments.  The  numerical  computation  of  optimal  policy  itself  would  require 
discretization  of  the  performance  space  and  maintenance  rates  m,  so  that  standard  algorithms 
such  as  the  policy  improvement  routine  (see,  e  g.,  Ross  1 1 3])  can  be  employed.  Given  the 
simple  structure  of  the  optimal  policy,  its  implementation  may  be  based  on  a  control  chart  type 

procedure  by  establishing  control  limits  !.v*..v* . vA")  on  the  system  deterioration  level  ,v,  so 

that,  if  v  f  Vi,*  the  system  should  be  replaced;  otherwise  if  v,*  ,  ^  x  ^  ,v,\  it  should  be  main¬ 
tained  at  an  expenditure  rate  m, .  greater  deterioration  corresponding  to  smaller  expenditures. 
The  selection  of  control  limits  may  also  be  based  on  simulation  studies. 
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ABS  TRAC I 

I  his  paper  in  .1  -si.iic-ol -tlic-.irt  review  of  the  literature  related  to  optimal 
maintenance  models  of  systems  subject  to  lailure.  1  he  emphasis  is  on  work  ap¬ 
pearing  since  the  ll>"(>  survey.  "A  Survey  ol  Maintenance  Models  The  Control 
and  Surveillance  of  Deteriorating  Systems."  hy  W  IV  Picrskalla  and  J.A.  Voelk- 
er.  published  in  this  journal 


1.  INTRODUCTION 

Maintenance  involves  planned  and  unplanned  actions  carried  out  to  retain  a  system  in  or 
restore  it  to  an  acceptable  condition.  Optimal  maintenance  policies  aim  to  minimize  downtime 
while  providing  tor  the  most  effective  use  of  systems  in  order  to  secure  the  desired  results  at 
the  lowest  possible  costs.  Proper  maintenance  techniques  have  been  emphasized  over  the  past 
two  decades  due  to  increased  complexity  of  systems,  increased  quality  requirements  and  rising 
costs  of  material  and  labor.  The  two  old  concepts  of  maintenance:  loving  care  (the  reliability  of 
the  equipment  is  directly  proportional  to  the  frequency  of  maintenance),  and  emergency 
replacement  (operate  equipment  until  it  is  inoperable)  may  not  be  optimal.  Both  methods  lead 
to  improper  maintenance,  excessive  breakdowns,  and  high  costs.  Since  the  1965  and  1967  sur¬ 
veys  on  maintenance  by  McCall  and  Barzilovich  |3 15,34],  a  great  deal  of  research  has  been 
done  in  the  field  of  optimal  maintenance  modeling,  involving  the  aspects  of  optimal  preventive 
and  preparedness  maintenance  policies.  Tables  1-3  give  the  references  in  various  classifications. 
Some  references  appear  more  than  once  in  Table  1  because  these  papers  consider  two  or  more 
topics  related  to  maintenance  models.  Also,  some  papers  are  not  referred  to  in  Table  2  because 
the  topics  of  these  papers  were  not  concerned  with  any  specific  model  type. 

2.  OPTIMAL  MAINTENANCE  MODELS 

The  literature  related  to  optimal  maintenance  models  is  classified  as  follows: 
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Optimal  Maintenance  Models 

I  Deterministic  Models 
2.  Stochastic  Models 

A.  Under  Risk 

B.  Under  Uncertainty 

1.  Simple  System 

2.  Complex  System 

a.  Preventive  Maintenance  (periodic,  sequential) 
or 

b.  Preparedness  Maintenance  (periodic,  sequential, 
opportunistic) 

Optimization  techniques  employed  for  obtaining  optimal  maintenance  policies  include  the 
following: 

l  inear  programming 
Nonlinear  programming 
Dynamic  programming 
Pontryagin  maximum  principle 
Mixed-integer  programming 
Decision  theory 
Search  techniques 
Heuristic  approaches 

The  characteristics  of  each  optimal  maintenance  model  considered  in  this  survey  will  be 
explained  briefly. 

2.1  Deterministic  Models 

These  models  incorporate  the  following  assumptions: 

•  The  outcome  of  every  maintenance  action  is  nonrandom. 

•  Maintenance  action  restores  the  system  to  its  original  state. 

•  The  purchase  price  and  salvage  value  of  the  system  are  taken  as 

given  functions  of  its  age. 

•  Aging  (were  and  tear)  increases  the  costs  of  operating  the  system. 

•  Aging  failure  is  not  necessarily  operational  failure. 

•  All  failures  are  new,  and  can  be  observed  instantaneously. 

•  By  prolonging  the  operating  life  of  the  system  through 

maintenance,  costs  are  incurred  and  benefits  may  increase. 

The  optimal  maintenance  policy  for  deterministic  models  is  periodic  and  the  times 
between  successive  maintenance  actions  must  be  equal. 

2.2  Stochastic  Models  Under  Risk 

Risk  is  a  time-dependent  property  that  is  measured  by  probability,  for  stochastically  fail¬ 
ing  equipment  under  risk,  it  is  impossible  to  predict  the  exact  time  of  failure;  but  the  distribu¬ 
tions  of  the  time  to  failure  of  each  component  of  the  system  are  known. 
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2.2.1  Simple  System  Preventive  Maintenance  Model  (periodic,  sequential) 

I  Ins  model  utilizes  the  hollowing  assumptions: 

•  I  he  system  time  to  failure  is  a  random  variable  with  known 

distribution 

•  l  he  system  is  either  operating  or  failed. 

•  I  ailure  is  tin  absorbing  state. 

•  Maintenance  action  regenerates  the  system  immediately  upon 

completion. 

•  I  he  intervals  between  successive  regeneration  points  are 

independent  random  variables. 

•  lhe  maintenance  cost  is  generally  higher  if  undertaken  alter 

an  operational  failure  than  before. 

The  optimal  policy  for  various  assumptions  is  as  follows: 

•  I  or  systems  with  unlimited  lifetime,  the  optimal  preventive 

maintenance  policy  is  the  strictly  periodic  one  — i.c.. 
maintain  system  at  failure  or  at  an  age 
whichever  occurs  first. 

•  I  or  systems  with  constant  failure  rate  (exponential):  maintain 

at  failure. 

•  for  systems  with  increasing  failure  rate  (Wcilbull.  gamma . 

etc.,  lor  some  parameter  values),  maintain  on  progressive  schedule. 

•  for  systems  with  limited  lifetime  (process  with  a  relatively 

short  lifetime,  or  equipment  subject  to  rapid  technological 
change),  the  best  preventive  policy  is  the  sequential  one. 

Ibis  sequential  policy  recalculates  the  maintenance  age 
i  after  each  overhaul  It  actually  attempts  to 
minimi/e  the  expected  cost  of  sy  stem  operation  over  the 
remaining  life  of  the  process. 

2.2.2  Simple  System  Preparedness  Maintenance  Model  (periodic,  sequential) 

This  model  utilizes  the  following  assumptions: 

•  The  system  time  to  failure  is  a  random  variable  with 

known  distribution 

•  The  actual  state  of  the  system  is  known  with  certainty 

only  at  the  time  of  inspection  or  maintenance. 

•  f  ailure  is  an  absorbing  state 

2.2.3  Complex  System  Preventive  Maintenance  Model  (periodic,  sequential, 
opportunistic) 

This  model  is  an  extension  of  2.2.1  for  complex  systems.  The  optimal  policy  lor  various 
assumptions  is  as  follows: 

•  If  the  parts  constituting  the  complex  system  are  interconnected 

in  such  a  way  that  they  can  be  considered  as  stochastically 
and  economically  independent,  then  the  optimal  maintenance 
policy  for  this  complex  sy  stem  reduces  to  that  of  the  simple 
system,  i  e..  employ  a  periodic  or  sequential  preventive 
maintenance  policy  for  each  separate  part. 
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•  II  individual  parts  unnni  he  considered  as  stochastically  and 

economically  independent,  then  a  policy  called  the 
oppiirtuntstic  maintenance  policy  will  he  more  effective.  tinder 
this  policy  ,  the  maintenance  of  a  single  uninspected  part 
depends  on  the  state  of  one  or  more  continuously  inspected 
parts  I  he  opportunistic  maintenance  policy  is  advantageous 
when  the  cost  of  a  joint  maintenance  action  is  less  than  the 
sum  of  the  cost  of  the  separate  maintenance  actions. 

•  II  a  complex  system  is  composed  of  a  large  collection  of 

identical  units  of  equipment,  then  a  block  maintenance 

policy  may  he  advantageous  Under  this  policy,  each  unit 

is  replaced  on  failure,  and  all  units  are  replaced  at 

periodic  intervals.  /.  2  7'.  3  7'.  ....  without 

regard  to  individual  unit  age.  Scheduled  and  unscheduled 

maintenance  can  he  combined.  Consequently ,  this 

policy  is  easier  to  implement,  and  results  in  lower  administrative 

and  maintenance  costs. 

2.2.4  Complex  System  Preparedness  Maintenance  Model 
(periodic,  sequential,  opportunistic) 

I  Ins  model  is  an  extension  of  2.2.2  for  complex  sy  stems.  The  optimal  policy  for  various 
assumptions  is  as  follows: 

•  If  the  complex  system  is  under  continuous  surveillance. 

then  this  model  reduces  to  the  preventive  maintenance 
model  described  under  2.2.3. 

•  If  the  complex  sy  stem  is  not  inspected,  then  the  only 

maintenance  policy  to  secure  the  highest  level  of 
preparedness  is  replacement. 

2.3  Stochastic  Models  Under  Uncertainty 

I  or  stochastically  failing  equipment  under  uncertainty,  the  exact  time  of  failure  and  the 
distribution  of  the  time  to  failure  are  not  known. 

2.3.1  Preventive  Maintenance  Model  for  Simple  and  Complex  Systems 

The  optimal  policy  for  various  assumptions  is  obtained  as  follows. 

•  When  the  system  is  new  or  failure  data  arc  not  known, 

the  minimax  techniques  are  applied. 

•  When  information  about  the  system  (failure  rate . 

etc.)  is  partially  known.  Chebyshev-type  bounds  are  applied. 

•  When  subjective  beliefs  about  the  system  failure  are  known. 

Bayesian  adaptive  techniques  are  applied. 

2.3.2  Simple  (complex)  System  Preparedness  Maintenance  Model 

The  techniques  of  minintax  strategies.  Chebyshev-type  bounds  and  Bayesian  adaptive  poli¬ 
cies  can  be  applied  to  this  model  as  explained  under  item  2.3.1. 
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I  AIM. I-!  1  General  Classification 
Type  _ References 


Inspection 


Maintenance 


Reliability 


Optimization  Techniques 


Decision  Theory 


24,  24,  TO.  XI,  104,  106, 
241,242,  251.  253,  254, 

254.  263.  2X0.  247.  337. 

434.  435.  475.  4X2,  44X.  51  X 

I.  5-10.  13.  15.  16.  24. 
31-37.  41.  44-5X,  60-70. 
74-X2,  X4-47.  101-106. 
115-120.  121-131.  146, 
154-161,  161-164.  172. 
177-1X3,  143,  202.  205 
211.  224,  233.  23X. 

242- 252.  255-257,  266-277. 
2X3-240,  245-301.  304, 
315-324.  334-348.  354-3X2. 
3X4-403,  408-421.  433.  438. 
444-452,  470-475.  4X2-444, 
502-513.  516 

2-8.  11-16.  21-24.  38.  34. 
44-71,  77-42.  108-114. 
123-133.  147-156.  162-184. 
147-213.  221-238.  241-245. 
258-271.  277-244.  303-318. 
314-317.  326-347.  352-364. 
372-376.  3X5-342.  348.  40X. 
414.  431-438.  450-464, 
494-500.  506.  511,  515-522 

5.  13.  18,  42.  103.  140, 

141.  188.  145.  147.  214. 

244.  248.  252-256.  357-367. 
383.  418.  442.  480.  4X7, 
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15-20.  26-33.  34-51.  73-40. 
44-117.  134-152.  158-162. 
140-201.  217-228.  230-237. 

243- 251.  255-260.  273-276, 

244- 247.  307-322.  336-340. 
355-360.  404-413.  422-430. 
434-443.  455-464.  475-484. 
512-516.  523.  524 
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I  Mil  l  2.  ( lussi/icainm  a/ 
ly  pc 

Deterministic 

Stochastic.  Inder  Risk 
l’re\ entice 


Preparedness 


<  )ppiiilumstie 

Stochastic,  Inder  Uncertainty 


Maintenance  Models  by  Type 
Relerenees 

77 76.  41.  42.  170.  1 2X. 
234.  264.  3 7 5 ,  36s.  366. 

3 SO.  3X4.  304.  39s.  401. 
442.  4X7 

4.  6.  13.  Is.  17.  21. 

22-29.  44-5  1 .  69-72. 

73-Xtl.  1  14-116.  1  19-124. 
138-152.  !  7s- |X4.  225-240. 

251.  261-2X0.  323-352. 
364-400.  441-4X6.  513-516 
10.  32.  43.  56.  73.  XI.  92. 
100.  104.  106,  170.  179. 
211.  213.  237.  241.  243, 
253.  259.  297-300.  324. 
33X.  342.  343.  369.  403. 
411.  4.3.5.  469.  4X5 

19.  47.  199,  316.  353-355, 
3X4.  392.  402.  437.  4X3. 
493.  511.512 
10.  ix.  34.  43.  75.  86.  90. 
100.  104,  105.  117.  1  IX. 
125.  126.  132.  157.  167. 
173,  185.  195.  209.  236. 

252.  255.  373.  417.  418. 
422.  433.  436 


I  ABU.  3.  C  iussi/ieaiion  by  Type  of  Applicable 
Optimization  Techniques 


Technique _ 

Decision  Theory 


Dynamic  Programming 

Linear  Programming 
Mixed-Integer  Programming 
Nonlinear  Programming 
Pontryagin's  Maximum  Principle 

Search  Techniques 
Simulation  Techniques 


Relerenees 

1-4.  6^17^14-39,  43-139. 
142-190.  i  92-2 1 8.  220-226, 
228-243.  245.  249-251. 
259-337,  359.  361-370. 
372-376,  385-387.  390-396. 
400-441.  443-460.  463-479. 
4X1.  483-496.  500-524 
40-42.  191,  199,  219.  244. 
246.  252.  255-258.  360.  442. 
462.  480.  488 
103.  271.  497 
384-497 

191.  227.  247.  248.  257 
5.  13.  140.  141.  389.  447 
482 

339.  37).  377 
374.  397,  499 
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( oncerning  the  rel erenee  list,  we  have  Hied  to  he  reasonably  complete.  however  those 
papers  which  were  not  included  were  either  considered  not  to  hear  directly  on  the  topics  ol  tins 
survey  or  inadvertently  overlooked  We  apologi/c  to  both  the  researchers  and  readers  t!  we 
have  omitted  any  relevant  papers. 

W e  woulil  like  to  thank  the  editor  ol  the  NRI Q  and  the  releree  lor  then  excellent  and 
exhaustive  review 
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ABSTRACT 

Problems  of  bounding  Pr  |.V  >  >1.  when  Ihe  distribulion  of  A  is  subject  to 
certain  moment  conditions  und  the  distribution  of  >  is  known  to  be  of  convex- 
concave  type,  are  treated  in  the  framework  of  mathematical  programming 
Juxtaposed  are  two  programming  methods:  one  is  based  on  the  notion  of  weak 
duality  and  the  other  on  the  geometry  of  a  certain  moment  space 


INTRODUCTION 

Let  X  and  its  cumulative  distribution  function  F()  represent  the  strength  variation  of  a 
certain  system,  and  let  Y  and  its  c.d.f.  G()  represent  the  variation  of  the  stress  to  which  the 

Jslem  is  subject.  When  X  and  Y  are  statistically  independent,  the  quantity  R  =  Pr  |2f  >  E)  = 
G(t)  dF(t)  is  commonly  referred  to  as  the  reliability  of  the  system.  The  problem  of 
estimating  R  has  been  addressed  in  the  literature  in  a  variety  of  different  contexts.  Among 
such  contributions  are  Birnbaum  and  McCarty’s  |2]  nonparamelric  procedures  for  confidence 
intervals,  Govindarajulu's  (7)  improvement  on  Birnbaum  and  McCarty’s  work  via  asymptotic 
normality  of  R.  Church  and  Harris’  [4]  parametric  procedures  for  UMVU  estimation  and 
confidence  intervals,  Enis  and  Geisser’s  [6l  Bayesian  inferences  on  /?,  and  Bhattacharyya  and 
Johnson's  111  generalization  to  multicomponent  systems.  In  this  study,  we  are  interested, 
rather,  in  bounding  R  with  respect  to  F  for  fixed  G,  and  in  determining  the  corresponding 
extremal  distributions  F*. 

We  note  that  this  problem  is  a  slight  modification  of  the  classical  variational  problem 
underlying  the  Tchebycheff  inequality;  we  need  only  replace  G  in  the  expression  for  R  by  a 
fixed  symmetric  set  characteristic  function,  and  then  maximize  R  subject  to  given  values  of  the 
first  two  moments  of  X.  We  note  as  well  that  both  problems  are  special  cases  of  what  in  Karlin 
and  Studden  (91  (Ch.  XII),  are  called  "generalized  Tchebycheff  problems,"  which  are  treated 
there,  essentially,  by  the  duality  theory  of  linear  programming. 

The  fact  that  problems  of  the  Tchebycheff  type  can  be  solved  effectively  in  the  framework 
of  linear  programming  theory  has  also  been  documenied,  for  example,  in  lsii  |8l.  Whittle  [17], 
and  Pyne  [131.  In  this  paper,  we  treat  the  optimization  of  R  through  programming  approaches, 
which,  though  kindred  in  spirit  to  the  above,  do  seem  to  be  especially  well  tailored  to  our 
problem,  when  the  further  assumption  is  made  that  G  is  "strictly  unimodal,  i.e.,  is  strictly  con¬ 
vex  to  the  left  of  some  point,  and  strictly  concave  to  the  righl. 

•This  research  was  partial!)  supported  by  the  Air  lorce  Office  of  Scientific  Research,  through  Grant  #78-3518  to 
Iowa  Stale  l  Diversity 
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I  ho  Inst  approach.  essentially  a  linear  specialization  of  (he  weak  duality  argument  of 
David  ami  kirn  Dl.  ictommcnds  itself  for  ils  simplicity,  but  fails  when  extremal  distributions 
Jo  not  exist  I  ho  second  approach,  based  on  the  geometry  of  a  certain  moment  space,  does 
succeed  in  such  situations,  hut  is  less  direct  We  note  in  passing  that  Brook's  |3|  bounding  of 
moment  generating  functions  offers  still  a  third  programming  alternative  for  the  optimization  of 
l<  In  Sec  2.  we  outline  the  first  method  in  conjunction  with  a  certain  pair  of  linear  programs 
l and  the  second  method  in  conjunction  with  a  certain  geometrically  motivated  program 
I’ll.  Sec  a.  devoted  to  the  first  method,  illustrates  the  construction  of  extremal  c.d.f.'s.  in  the 
context  of  two  simple  examples  Sec  4  illustrates  the  second  method,  using  the  examples  of 
Sec.  a 

2.  FINFAR  PROGRAMMING  FOR  Ml  I.ATIONS 

Detine  the  following  classes  of  functions: 

&  The  class  of  "generalized  c.d.f.'s"  /  of  the  form  cl".  where  i  ?  0  and  /"a  c.d.f.  on  the 

line 

T  he  class  of  discrete  c.d.f.'s  /  on  the  line  with  at  most  »+ 1  jumps. 

The  class  of  c.d.f.'s  O' on  the  line  (hat  are  strictly  convex  to  the  left  of  0.  and  strictly  con¬ 
cave  to  the  right 

The  class  of  c.d.f.'s  G  on  the  line  that  are  strictly  concave  on  |0.  °°l  and  identically  0  oth¬ 
erwise. 

further,  we  assume  that  a  <7  €  ^possesses  probability  density  function 

*<•>. 

for  a  given  //-tuple  //(/)  S  (//,</) . //„</)),  where  each  /»*.</)  is  a  piecewise  continu¬ 

ous  function  on  the  line,  let  C'//|/t(/:j)|  denote  the  convex  hull  generated  by  hit )  when  we 
vary  t  over  the  line,  for  a  given  point,  b  —  <b|,  . . .  ,  b„)  €  ( 'll  I  //  (L] )  I,  we  formulate  a  linear 
program. 


Pi  maximize  J 

1  GU)  dru) 

(2.1a) 

subject  to  J 

f  dun  =  i 

(2.1b) 

J 

hkU )  Jl  G)  =  bk. 

1  ^  A  <  // 

(2.2) 

and  /•  €  & 

where  G  is  a  fixed  c.d.f.  in  ^ 


We  note  that  the  underlying  space  JRn  (2.2).  being  free  from  normalization,  is  a  convex  cone. 
Fxploiting  the  cone  structure  of  both  (2.1)  and  (2.2)  in  conjunction  with  standard  dual  cone 
theory  (Luenberger  (10).  (p.  157),  and  Sposilo  |I5).  (p.  261)),  we  may  write  down  a  linear  pro¬ 
gram  formally  dual  to 

I),:  minimize  A  fi1 

subject  to  A  tj  (/)'  >  (Hr).  V  t  € 

and  A  =  (An.  A i . A„)  €  . i . 

where  fi  =  ( I  .h  )  and  tj  </)  s  (!.//</)). 


(2.3) 

(2.4) 
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Note  first,  by  (2.1)  and  (2.3),  that,  if  is  feasible  for  P:  and  A°  is  feasible  for  D,,  then 
(2.5)  J  GU)  rff’tr)  <  J  A"r)"(/)r  dP'it)  =  A"  fi1 . 

Iherefore,  (as  suggested  in  David  and  Kim  15),  and  Pukelsheim  1 1 2 1 ) ,  if  we  find  a  feasible 
solution  pair  (/  *.  A’)  that  satisfies 

(2  b)  j  [A*  r,  <  / ) y  -  (7(r  )|  (//Mr)  =  0. 

then  <  / ’.  A*)  is  in  fact  an  optimal  pair  for  (/’,./>,).  and,  certainly,  /■*  satisfying  (2.6 )  will  need 
to  concentrate  its  mass  on  the  "set  of  osculation" 

<2.7>  /'(am  =  1 7  e  /:  1 1  A  •  ?/ 1  r  ) 7  6 <r )  =  0), 

whose  cardinality  is  bounded  usually  by  n  +  2,  when  the  functions  { Git),  rj(r)|  are  linearly 
independent  on  the  line.  (See  Karlin  &  Studden  [9].)  Sec.  3  contains  detailed  demonstrations 
on  how  to  construct  an  extremal  c.d.f.  /'*  (which  turns  out  to  be  supported  at  only  n  points  in 
one  example,  and  (n  —  I )  points  in  another  example). 

Now.  with  special  reference  to  the  second  approach,  consider  the  following  program  P}, 
an  essentially  finite  dimensional  version  P/. 

Pi :  maximize  f  GU)  ill  U) 

subject  to  J*  lijt)  ill  U)  =  hk.  I  <  A  ^  n 

and  /  € 

The  fact  that  Pt  and  Pi  yield  the  same  optimal  value  follows  from  the  general  considerations  in 
Rogosinski  1 1 4|  and  Mulholland  and  Rogers  111).  The  reduction  of  P,  to  P\  provides  a  useful 
geometric  version  of  our  problem,  in  that  the  class ^  of  P)  generates  the  convex  hull  C7/|T] 
of  the  trace 

T  =  |  a  =  (a, . v„,  ,v„+ 1 )  I  a*  =  //*(/).  1  <  A  <  //,  and 

v„.|  =  GU).  for  sonic  i  €  /:). 

Hence,  we  are  led  to  the  equivalent  program 
I’ll  _  SUp  A„.| 

where  (  =  (  //H  I.  andj^.  =  |a  l.xy  =  hk.  I  <  k  n.  ,\,M ,  6  /',). 

See  Van  Slyke  and  Wets  1 1 6 1 .  and  Pyne  1 1 31  for  similar  constructions. 

Since  the  set  (  in  is  convex,  the  optimal  value  v,%,  of  Plt  may  be  obtained  by  asso¬ 
ciating  this  value  with  a  suitable  supporting  hypcrplane  Hh  of  C  at  the  boundary  point 

Hi i . h„ ,  ,v*.|).  finding  the  equation  for  llh  is  not  easy,  however,  since  C  is  known  only 

through  T.  In  Sec.  4  the  problem  of  finding  Hh  is  attacked  by  considering  Hh  as  a  certain  limit 
of  all  hyperplanes  in  |  that  cut  or  touch  the  set  T. 

3.  II  I. I  STRATION  OK  THF.  FIRST  APPROACH 

l.XAMPl.l.  I.  We  wish  to  find  the  maximum  reliability  K *  of  a  system  whose  strength 
distribution  /  is  known  to  have  mean  0  and  variance  h  >  0,  when  the  distribution  of  the  stress 
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lo  which  the  system  is  subject  is  given  by  a  known  continuous  c.d.f.  <7  in  We  compute  R  * 
via  constructing  of  an  extremal  c.d.f.  F*  (Remark:  There  is  no  loss  of  generality  in  fixing  the 
common  value  of  the  mean  of  Fand  the  mode  of  (7  at  zero.  If  their  common  value  is  in  fact, 
say,  a  positive  value  M ,  then  the  corresponding  F**  is  obtained  by  shifting  F*  obtained  below  to 
right  by  M.)  Specializing  the  program  pair  (F/.D,)  of  Sec.  2  to  this  problem,  we  find  the  pro¬ 
gram  pair 

F/,:  maximize  J  GU)  dF(t) 

subject  to  J"  dFU )  =  1 

(.1.1)  J  i  dFU)  =  0 

(.1.2)  J  r  dFU)  =  b. 

and  F  € 

I)//:  minimize  A(>  +  k2  b 

(.11)  subject  to  Ad  +  k i  /  +  k2t2  ^  G U ),  Vr  €  E\. 

and  A  -  (A„.  A |.  A>)  €  Ft. 

The  osculating  set  HA*)  of  Sec.  2  now  is  the  set  of  r's  where  the  parabola  F(r)  =  A,*  + 
A  +  k*r:  lying  above  the  "convex-concave"  function  6'(r)  touches  G'(r).  At  such  r.  the 
derivatives  F’(r)  and  G’Tr)  must  coincide,  and.  since  F'(r)  is  linear  and  G'(t)  is  either 
"increasing-decreasing"  if  G  €  or  "identically  zero-decreasing"  if  G  €  c£>.  there  can  be  at 
most  two  such  r's.  We  recall  from  Sec.  2  that  the  spectrum  of  F*  must  be  contained  in  T(k*). 
Hence,  in  view  of  restriction  (3.2),  the  spectrum  of  F*  consists  of  exactly  two  points  s  and  t 
(with  respective  weights  />  and  <  1  — /j ) ) ,  which,  in  addition,  must  be  of  opposite  sign  in  view  of 
restriction  (.1.1).  say.  s  <  0  <  r 

Pooling  all  our  findings  and  restrictions,  we  write  down  the  following  nonlinear  relations 
in  the  six  unknowns  s.  i,  />.  An.  A  *.  and  A  *: 

t  .1.5)  \  p  +  /  ( I  —  p )  =  0 

( .16)  s'/)  +  r'(  I  -/> )  =  h 

(.17)  A n  +  A  * r  +  A  *r '  =  (Hr),  r  =  s. I 

(.1.8)  Af+  2A?r  =  .girl,  r  =  s.t 

1. 1.9)  A  *  >  0. 

Moreover,  the  optimality  condition  (2.6)  adds  the  further  requirement 
(3.10)  G(s)  p  +  G(t)  ■  (1  -p)  =  A  ,*  +  A  * 6. 

Solving  (3.5 )-( 3  10)  for  the  six  unknowns  reduces  to  finding  a  positive  I  (and  negative  s  = 
6//)  satisfying 

(111)  l/2U'0)  +  mi-b/t)  I  it  +  b/i  I  =  GU)  -  G(-b/i). 

for  G  (  '§  ,  relation  (3  11)  implies  that  i  should  be  chosen  such  that  the  area  under  the  den¬ 
sity  ,c’(  )  between  -  bit  and  i  equals  the  area  of  the  trapezoid  formed  by  the  four  points 
|(  bit.  0).  G  bit,  .g(-/>/2)).  G.gG)),  (r.0)|.  for  G  (■  $ < ,  it  turns  out  as  well  that  we  are  to 
equate  the  area  under  c (  )  between  0  and  t  with  the  area  of  the  triangle  formed  by  points 
!(  hit.  0).  <U’G)).  (/.())). 
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/  V  \ Mi'Ll  2  Hero  we  consider  a  slighi  modification  of  Example  I;  i.e..  the  resiriciions 
(3.1  >  and  (3.2)  are  replaced  respectively  by 

(3.12)  f  hint)  =  />, 

(313)  f  I/I  tint)  =  />. 

The  osculating  set  /  ( A  * )  of  Sec.  2  now  is  the  set  of  r's  where  the  wedge  H  h)  —  A  + 
A  f-  +  A  f  1/1  lying  above  (»  (t)  touches  (Hr).  In  view  of  the  strict  concavity  of  Ci(-)  for  t  ^ 
0  and  the  specification  of  /  (A*)  above,  the  set  /'(A*)  is  reduced  to  a  certain  nonnegalixe  sin¬ 
gleton.  which  fact,  in  turn,  implies  that  we  should  coniine  our  search  for  an  extremal  /■*  to  the 
class  of  degenerate  c.d.f.'s.  However,  since  those  /  ’ s  that  satisfy  (3.1  I )  and  (3.12)  with  b\  ^ 
/>-  cannot  be  degenerate,  the  weak  duality  method  requiring,  as  it  does,  the  existence  of  an 
extremal  c  el  l.  /  *  is  not  applicable  in  this  case.  Only  in  the  trivial  case  />  1  =  />;  >  0.  can  we  fix 
/  '  through  /  (A* );  i.e..  by  weak  duality. 

4.  II  I. I  SI  RATION  OF  THF  SF.COM)  APPROAC  H 

/.A  I  A//'/./.'  a.  (Fxample  2  of  Sec.  3)  Specializing  of  the  formulation  of  I’n  in  Sec.  2  yields 

where 

C  =  C  //II  I.  I  -  { a  l.v,  =  /.  A.  =  (/).  and  A,  =  (HD. 

(4.1 )  some  /  €  /:',) 
and 

(4.2)  li .  =  |.v l.v  1  =  A|,  ,v>  =  by.  and  a  -,  €  A||. 

Since  no  triple  of  distinct  points  of  F  can  be  colinear.  any  such  triple  determines  as  hy  per¬ 
plane  that  cuts  or  touches  F.  The  idea  of  our  second  approach  is  then  to  Find  a  "best  triple"  that 
yields  the  highest  hyperplane  at  h  —  (/?,./> : )  among  the  collection  H  of  all  "qualified"  triples  ir. 
In  what  follows,  using  the  unimodalily  of  (i.  we  are  able  to  reduce  It  to  the  collection  I  of 
"qualified"  pairs  1  of  distinct  points  in  I  . 

To  see  this  in  detail,  we  first  introduce  the  notation  1 1  (.S')  for  the  projection  of  the  set  S' 
into  the  plane  vs  =  0  Now  partition  F  into  F  and  I",  where  F  is  the  left  side  of  F. 
corresponding  to  1  <  •).  and  I  '  is  the  right  side  of  F.  corresponding  to  /  >  0.  and  define 

It  =  lulu  satisfies  the  condition  that  h  f  ('//111  Hr 4)1. 

It  '  =  {it  in  f  It.  and  any  two  of  the  three  points  of  11  in 

I",  with  the  remaining  point  in  F  ].  and  It  =  It  -  It  '. 

Also,  define  /1  <»  ;/>)  =  height  at  b  of  the  hypcrplane  determined  by  a  triple  m.  (Note  that  I’u,, 
is  equivalent  to  lind  sup  /)( it  :/>).) 

I  sing  essentially  the  convexity  of  (»'(/ )  for  t  <  0.  it  can  be  demonstrated  that 

1.1  MM  A  I  I  or  any  triple  ir  <  II  .  there  is  a  triple  n't  It  '  such  that  liiw'.h)  ^ 

//  ( 11 :  /» ) .  Hence.  sup//(n,/>)  =  sup  li(n.b).  Next,  we  deline  the  collection  of  pairs 
«  „  •  11 


(,  KIM 


so 

l  =  jvlonc  point  of  v  in  I'  and  the  other  point  in  v 
is  I'  ‘ .  and  also  b  >  (  //  |l  I  <  v )  || 
l  snip  the  stint  concavity  of  (Hi)  for  i  >  0.  we  find 

LIMMA  2  sup  hlw.b)  —  sun  //  ( v  ;/> ). 

It  O  I 


lo  evaluate  the  right  hand  side  of  the  above,  we  need  an  explicit  expression  for  h  ( v;/> ) : 
Parametrizing  bv  the  < acute >  angle  n  between  IIU  //|r|)  and  11(1").  and  redefining  h  accord¬ 
ingly  .  we  write 

li  to  ,h  I  =  (H—f>  -  it  tan  a )  8/(8  +  ir  tan  a  ) 

(i  (it  +  8  cot  a  )  <r  tan  a/ (8  +  it  tana). 


w  here 


8  -  I  2 lb  />  !.  it  -  I  2l/>;  +  />(),  and  a  £  < < r .  rr/2). 

\  further  reparanietri/ation  by  /i  (8  +  <r  tana),  (and  redefining  li  accordingly),  yields 
(4  a  l  li  (/i  :b  I  />(i  (  8/  /» )  +-  (I  /> )  (Hit/  ( I  /;)./»  (  (0.1). 

With  the  expression  (4 ..»)  for  /;.  the  monotonicity  of  (i  allows  the  conclusion  that 
slip  /;  </>./>(  (Hit). 


Moreover,  if  h  *  />-.  then  there  tloes  not  exist  an  extremal  c.d.f.  achieving  the  optimal  value 
(Hit  i  of  corresponding  to  the  fact  that  I  is  not  bounded.  If  /> t  =  b?  >  0.  however,  there 
is  an  extremal  c.d.f.  1  degenerate  at  />i>  achieving  (/Or) 


/.  \  IMI'I.I.  h  <1  xample  I  of  Sec.  a)  In  this  case.  (4.1)  and  (4.2)  of  It. sample  a  are 
replaced  by 

(4.4)  I'  =•  !.v  v ;  =  /.  .vs  =  r.  and  v ,  =  (Hi):  some  /  €  /.' , ) . 

(4  5)  J  «•  !.v  ;.V|  -  0.  a  >  />.  and  v  i  =  £  /:' | ! . 


following  the  analogous  argument,  we  reparametri/e  v  £  I  the  modification  of  I  per¬ 
tinent  to  <4  41  mil  (4  s).  bv  i/r|().7r|.  where  the  angle  o  is  between  the  line  extending 
1 1  ( (  // 1  v  | )  and  the  v  |  -axis 

for  given  h  ~  (()./>).  r>y  letting />  =  1/2  tan  <).  we  find 

1 1  •  /i . b )  =  (Up  +  iip)i  !(/(/»)  -  />)/ 2 /■(/»> } 

( 4  h )  +  (Hp  rip))  ■  J  (/  (/))  +  /i )/ 2 /(/>)} , 

where  /  Ip)  -  </>-’  +  A)1 

It  is  easy  lo  c  heck  that  hlp'.h)  is  concave  on  (-  *>.0|  and  convex  on  (0,  °°>  jp  view  of  the  fad 

that  (i  (  )  ’  fi  Differentiating  (4.6)  with  respect  to  p  and  setting  it  equal  to  0  yields 

(4  "  I  c  Ip  *  /  Ip  ) )  f  y  Ip  ilp))  -  \(Hp  +  rip))  --  (Up- r(p))]/ r(p), 

which  is  to  be  solved  for  the  p'  in  (  <».()|  that  maximized  (4.6).  We  note  that  (4.7)  does 
reduce  to  (a  II)  with  /  —  p  4  rip). 
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ABSTRACT 

In  discounted  Markov  decision  processes  hounds  on  the  optimal  value  func¬ 
tion  can  be  computed  and  used  to  eliminate  suboptimal  actions.  In  this  paper 
we  extend  these  procedures  to  the  generalized  Markov  decision  process  In  so 
doing  we  forfeit  the  contraction  property  and  must  base  our  analysis  on  other 
procedures  Duality  theory  and  the  Pcrron-f robenius  theorem  are  the  main 
tools 

I.  INTRODUCTION 

In  this  paper  a  finite  state  and  action,  infinite  horizon,  generalized  Markov  decision  pro¬ 
cess  consists  of  a  finite  set  of  s  states  denoted  by  -S';  a  finite  set  of  actions  A ,  for  each  /  €  S:  an 

immediate  reward  c,"  for  each  /  €  .S'  and  a  6  A,  and  a  weighted  "generalized  probability"  for 

S 

each  /.  /  €  ,S  and  a  €  A,.  Let  A  =  .V  .-I,  denote  the  set  of  decisions.  F:or  8  f  i,  c*  refers  to 

i=i 

the  s  x  1  reward  vector  where  c/*1"  is  the  immediate  reward  for  using  action  SO)  in  state  /  and 
Ph  is  the  s  x  ,v  generalized  probability  matrix  associated  with  using  decision  S.  A  generalized 
Markov  decision  process  requires  that 

1.  Pf,  >  0  for  each  S  €  A 

2.  p(Pb)  <  1  for  at  least  one  8  €  A 

3.  D  =  | v:v  ^  Pf,\t  +  c\  8  €  A|  *  0 

where  p(P)  is  the  spectral  radius  of  the  square  matrix  P. 

Lei  j£,U)  and  J^U)  be  defined  over  R'  where 
J^,(v)  =  P,v  +  c* 

and 

l£(\)  =  U-MaxX(v) 

h »  A 

where  k  -Max  means  vector  maximization.  Since  each  Ps  W  isotone  ( i.e.,  ,v  >  i  implies  Ph.\  > 

P*yi.  and  Jtfare  accordingly  isotone.  Notice  that  may  not  be  a  contraction  mapping  or 
even  an  /V-stage  contraction  mapping  and  thus  may  not  possess  a  unique  fixed  point  [2].  Since 
I)  0,  it  is  easy  to  show  that  jZ’nas  at  least  one  fixed  point  |7,8l.  Let  f  =  (v:v  =  Jjftv))  be 
the  set  of  fixed  points  of  J*  We  wish  to  solve 

•'this  research  was  performed  when  the  author  was  affiliated  with  the  School  of  Industrial  l.ngmeering,  Purdue  Univer¬ 
sity.  West  t  afavette.  Indiana 
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v*  =  V  -Min  v. 

/ 

This  problem  is  well  defined  and  is  motivated  in  [7,8] . 

Such  problems  were  studied  in  [7 ]  as  a  generalization  of  I8i  and  encompass  traditional 
discounted  Markov  decisions  (6 1 ,  the  discounted  processes  investigated  by  Veinott  [17]  and  the 
more  general  processes  resulting  from  the  duals  to  linear  programs  with  (hidden)  Leontief  Sub¬ 
stitution  Systems  and  (hidden)  essentially  Leontief  Substitution  Systems.  The  latter  two  cases 
include  such  applications  as  completely-ergodic  nondiscounted  Markov  decision  processes  [9], 
shortest  path  problems  (with  or  without  cycles),  and  the  stopping  model  of  Denardo  and  Roth- 
blum  [3 1. 

It  has  long  been  known  in  the  context  of  the  traditional  discounted  Markov  decision  pro¬ 
cess  [10,12,13,15]  and  more  recently  in  the  discounted  processes  of  Veinott  [12]  that  bounds  of 
the  form  /  <  v*  <  u  can  be  constructed  and  used  to  eliminate  inferior  actions  from  further 
consideration  as  potential  candidates  of  an  optimal  stationary  policy  [4,5,1 1,12,13,15], 

In  this  paper  we  extend  the  development  and  usage  of  bounds  on  v*  to  the  generalized 
Markov  decision  setting.  Since  most  results  in  the  literature  were  developed  using  a  contraction 
argument  and  the  generalized  process  does  not  usually  possess  this  property,  we  must  utilize  a 
slightly  different  set  of  machinery.  We  will  rely  heavily  on  duality  theory  and  the  Perron- 
I  robenius  theorem  (see  Varga  [16]  or  Seneta  [14]). 

2.  NOTATION  AND  PRELIMINARY  RESULTS 

Let  v  and  \  be  two  vectors.  Write  x  >  i  (respectively,  ,v  >  v)  if  .v,  ^  (respectively, 
,v,  >  )  x,  for  every  /'.  Also  write  v  -J  i  if  x  >  s  but  x  ^  y.  Let  l.  Lv)  =  [z:;  <  .v)  and  if  T  is  a 

set.  let  /.  (  D  =  U  l.  (.v).  If  P  is  a  square  matrix,  p(P)  will  denote  the  spectral  radius  of  P  If 
1 

P  >  0  and  square  then  the  Perron-F  robenius  theorem  gives  us  that  Px  =  piP)x  for  some  .v  > 
0  and  p(P)  ^  0.  </—/'*)  1  exists  and  is  nonnegative  if  pi P)  <  1. 

from  [1,18|  we  have  that  v*  is  given  by  some  6*  €  A  where  piPf)  <  1  and  v*  = 

•  /- P * )  1  c"\  In  this  paper  we  are  interested  in  finding  v*  by  successively  iterating,#7  That 
is.  v"  =  <#'( v11)  where  v"  is  an  initial  guess  of  v*  Let  C  =  |v:v*  =  lim  ir"(v)]  be  the  set  of  all 

H  —  or 

starting  points  leading  to  v*  under  the  successive  application  of#7  C  ^  0  since  v*  €  C.  In 
both  the  discounted  Markov  decision  process  and  Veinott’s  discounted  process  C  =  R\  In 
general,  however,  (  ^  R'  [7,8],  A  useful  result  obtained  by  Koehler  [7,8]  is  that  HO  C  (’ 
so.  since  v*  6  C,  /,  (v*)  C  C  The  following  short  example  gives  a  case  where  L(v*)  =  C. 
The  problem  is: 

State  Action  P"  c," 

n  i  0  0  0 

2  0  10 

2  1  10  0 

It  is  readily  determined  that  v*  =  0,  l)  —  F  =  [\c:K  :>  0]  and#tv*)  =  j.v:.v  <  0)  where  e  is  a 
vector  of  ones.  F  or  any  starting  vector  v°  we  get  for  n  >  2 

Max  (O.vl!) 

Max  (0.  v,") 


#'(v")  = 
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where  h  =  I  and  /  =  2  if  h  is  even  and  h  =  2  and  /  =  I  if  n  is  odd.  Hence,  f£"(x")  converges 
if  and  only  if  v"  <  0,  i.e.,  v11  6  /.  (v*L 

From  a  practical  point  of  view,  it  is  easy  to  pick  a  point  of  L(v*L  For  example,  let  v"  = 
Md  where  M  «  0  and  d  >  0.  Thus,  since  Hv*)  may  be  C'  and  picking  points  of  Z.  ( v * >  is 
relatively  easy,  when  we  restrict  attention  to  the  case  where  v"  €  L(x*)  we  do  so  without  much 
practical  loss  of  generality.  In  the  previous  section  we  defined  D  =  iv:v  ;>  P,,v  4-  cs,  8  €  A)  = 
jv:v  >  if'v)).  We  wish  to  express  this  set  in  one  further  way.  Define  the  vector  /”  by 

ji  -  n  ,  =  / 

'll  \-Pf  i  ^  / 

where  /,/  €  S  and  u  €  ■<,.  Let  /  ’  be  a  matrix  having  each  ff  as  a  row  where  a  €  /),,  i  €  .V. 
Corresponding  to  /  ',  let  c  be  a  vector  of  the  c,"  values.  Then  we  can  write  D  as  D  =  (v:/-  'v  > 
c).  The  matrix  A  is  essentially  Leontief  [7!  and  since  p(P)  <  1  for  some  8  €  A,  the  set 
{x.P.x  >  0,  .v  >  0l  is  nonempty  [18l. 

3.  ELIMINATION  OF  SI  BOPTIMAL  ACTIONS 

Suppose  one  has  bounds  /  and  it  such  that  I  <  v*  <  u.  If  action  a  6  A,  is  part  of  an 
optimal  policy,  then  the  inequality  v,  >  LP"v,  +  c"  must  be  tight  at  v*.  Clearly  then,  if  the 
above  inequality  is  never  tight  in  the  polytope  B  =  |x:/  <  x  u)  it  cannot  be  tight  at  v*  and 
should  be  eliminated  from  further  consideration.  A  typical  test  for  checking  this  condition  is  if 

(1)  I P“ n,  +  <•;'  <  /, 

then  a  is  suboptimal  (see  (5,11,12,13.15)  for  such  examples). 

A  tighter  test  results  directly  from  duality  theory.  The  inequality  v,  >  I P“\,  +  if  is  not 
tight  in  B  if  and  only  if 

(2)  If  >  0  and  (I  -  PfHu,  -  lr)  <  If 
or 

If  <  0  and  I  Ifiii  -  /,)  >  If 

'i!‘ 11 

where  If  =  if  -  Iff  /,. 

Notice  that  the  test  in  (I)  can  never  eliminate  an  action  when  If  >  0  but  that  (2)  allows 
this  condition.  Anything  eliminated  by  test  (1)  is  removed  by  (2). 

4.  BOUNDS  FOR  THE  GENERALIZED  PROBLEM 

We  begin  our  development  of  bounds  on  i>*  by  considering  the  restricted  case  where  A  = 
(8)  and  p(P0  <  1.  That  is,  we  wish  to  determine  bounds  on  v*  =  (/  -  Ph)  1  c*.  Both  Por- 
leus  [12]  and,  indirectly,  Veinott  (17)  have  investigated  this  case.  Porteus  first  transforms  the 
process  into  an  equivalent  one  where  the  new  transition  matrix  Pfi  has  all  equal  row  sums 
(which  are  necessarily  less  than  1.0).  Once  this  has  been  accomplished,  bounds  such  as 
(10,12,13,15)  can  be  computed.  Here  we  do  not  transform  the  data. 

For  the  time  being,  let  us  suppress  8.  Let  d  >  0  but  otherwise  arbitrary.  Let  a  and  b 
satisfy 


< .  I  kill  III  I  K 


Xb 

( 3 )  ad  <  v"  ‘ 1  -  v"  <  />(/. 

We  wish  to  develop  bounds  of  the  form 

(4)  /  =  v"“  +  <»</  <  v*  <  v" ' ‘  +  pd  =  u 

where  A  is  a  nonnegative  integer.  Here  v”  =  v"  e  C  =  Rs  and  lim  v"  =  v*. 

Since  Pk  is  isotone,  from  (3)  we  have 

(5)  <  /JV"  1  -  Pkx"  <  bPkd. 

Multiply  (5)  by  P  to  get 

uPk ' '</  <  Pk '  'v"’ 1  -  Pk  “v"  <  bPk  "d 
and  add  this  to  (5).  We  get 

ad  +  P)Pkd  <  Pk'  'v"*1  -  Pkv"  <  bU  +  P)Pkd. 

Repeating  this  procedure  and  taking  limits  gives 

(6)  a(l  -  P)  'Pkd  <  Pkv*  -  Pk\n  <  bU  -  P)  'Pkd. 

PROPOSITION  1:  Let  v*  =  Px*  +  c  where  P  ^  0  and  p(P)  <  1.  Let  v"  +  l  =  P\n  +  c 
and  d  >  0.  a  and  b  are  such  that 

ad  <  v"*1  -  v"  <  bd 

and  k  ;>  0  and  integral,  then 

ad  +  v",A  <:  v*  <  fid  +  v"  +  A 

where 

P  >  0  ifh  =  0 
P  >  by  if  b  >  0 
P  >  by  if  b  <  0 

and 

a  <  0  if  a  =  0 
a  <  ay  if  a  >  0 
a  <  ay  if  a  <  0 

where 

y(y)  =  Max  (Min)  x'Pkd 

subject  to 

x'd  -  x'Pd  =  1 
A '(/  -  P)  ^  0. 

PROOF:  We  will  prove  the  result  for  p  and  note  that  the  proof  for «  follows  in  a  similar 
manner.  Let  v  =  bd  —  P)  1  Pkd  >  v*  —  v"tA  as  given  in  (6).  Then,  by  duality,  y  <  pci  if 
and  only  if  x'd  -  P)  >  0  implies  Px'(l  -  P)d  >  bx’Pkd.  Notice  that  x'd  —  P)  >  0  implies 
x  >  0  since  (I  -  P)  0.  Also,  since  p(P)  <  1,  using  the  Perron-Frobenius  theorem  we 
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get  that  x'U  -  P)  =  0  if  and  only  if  a  =  0  and  x'U  -  P)  ^  0  whenever  x  >  0  gives 
a  (/  -  I’)  >  0.  There  is  such  an  x  ^  0  (use  x'  =  d'U  -  P)  ').  Hence,  fl  must  satisfy 

p  -  x'U  -  /»>,/ 

whenever  x'U  —  P)  >  0  with  v  ^  0.  Fnumerating  the  cases  where  b  =  0,  b  <  0  and  b  >  0 
gives  the  results  of  the  theorem.  We  need  only  show  that  the  objective  function  of  the  linear 
programs  is  bounded  for  all  feasible  points.  Suppose  this  is  not  true.  Then  there  is  a  ru"  z  ^  0 
such  that  z'd  -  :'P<!  =  0  or  :’U  -  P)d  -  0.  Since  d  >  0  and  z'U  -  P)  >  0,  z'U  -  P)  =  0. 
This  gives  that  r  =  0,  a  contradiction. 

Some  useful  cases  follow. 

COROLLARY  I:  When  A  =  0, 


and  when  A  = 


-  q  r 


qir)  =  Max  (Min)  x' Pd 


x'd  =  1 

x'U  -  P)  >  0. 


Note  that  q  and  rare  both  strictly  less  than  1.0. 

COROLLARY  2:  If  d  is  an  eigenvector  of  P  with  Pd  =  Kd,  then 
\A 

y  =  r- 


Most  of  the  bounds  reported  for  discounted  Markov  decisions  fall  into  one  of  the  two 
cases  given  above.  Usually  d  is  a  vector  of  ones. 

While  determining  y  or  y_  is,  in  general,  a  nontrivial  task,  one  can  usually  obtain  useful 
bounds  on  y  and  y  and  use  these.  For  example,  the  Perron-Frobenius  eigenvector  is  a  feasible 
solution  so 

Also,  as  is  commonly  known, 

(Pd),  (Pd), 

Min  — —  <  p(P)  <  Max  - — ; — . 

i  (l,  i  u, 

The  dual  problems  also  provide  bounds  although  one  must  obtain  tight  enough  upper  bounds  to 
be  meaningful. 


We  now  return  our  attention  to  determining  bounds  for  the  generalized  Markov  decision 
process.  In  the  following  we  assume  v"  €  L  (v*)  so  that 
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and 

(7)  =  Max  v,''“) 

provides  us  with  u  lower  bound  to  v*  at  each  iteration.  An  upper  bound  is  not  as  easy  to 
derive. 

In  the  unlikely  event  that  8*  is  known,  one  can  use  the  upper  bound  developed  in  Propo¬ 
sition  I  since 

jP*.(v',  +  I  -  V")  <:  bP^.d 

plus 

Pkh:'(v"^  -  v")  <  bPf, t 1  d 

gives 

Pl&.W")  -  P^.v"  <  bd  +  P*.)Pk.d 

or,  in  the  limit, 

P£.v*  -  A^.v"  <  b(l  -  Ph.)-'P^d. 

That  is. 

v*  -^‘.(v")  <  bil  -  Pt.r'Pfi.d. 

The  resulting  bound  is 

(8)  v*  <y;.(v")  +  fid  <  v"+A  +  (id 

where  fi  is  given  in  Proposition  1  and  y  and  y  correspond  to  /V  For  k  =  0  we  get  v*  ^  v"  + 
/3r/and  for  k  =  1  we  get  v*  <  y,.(v")  +  (id  <  v"+l  +  (id. 

We  realize,  of  course,  that  if  8  *  is  known,  one  would  ignore  ail  other  8  €  A  and  work 
only  with  8*.  A  more  reasonable  case  is  if  8  *  is  unknown  but  ys<  is  known.  Since  v°  <  v*, 
v"  ’ 1  ^  v"  for  all  n  unless  v"  =  v  *.  Hence,  v'1  ‘ 1  -  v"  ^  bd  implies  b  >  0.  Thus,  knowledge  of 
is  sufficient  for  determining  an  upper  bound  on  v*. 

As  an  illustration  of  (8)  and  the  elimination  procedure  of  (2)  consider  the  following 
example: 

Example  1  State  Action  Pf‘  cf 

1  \  0  2  T 

2  0  13 

2  1  20-6 

2  10-3 

3  0  0  -1 

N ole  that  no  Ph  has  all  its  rows  less  than  one.  Here 


p(P*.)  =  0 
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d  =  ({)  n-  ”  I  for  k  -  1. 

Let  v°  =  -lOe.  Then 


so 


Let  b  =  9  and  p  =  9.  Then. 


<  v* 


2 

8  ' 


Using  the  elimination  procedure  of  (2)  we  get 


State 

Action 

c" 

Test  Value 

1 

1 

7 

9 

2 

9 

9 

2 

1 

-19 

-18 

2 

-9 

-9 

3 

0 

0 

Eliminate 


Suppose  neither  8  *  nor  is  known.  Consider  the  following.  Let  4'C  4  such  that  8*6 

A’,  8  6  A’  implies  p(P/>)  <  1  and  if  8 (/)  =  8(/)  for  some  8  6  A’  for  each  i  -  1 . s  then 

8  €  A’.  A  special  case  is  A’ =  |8*).  After  appropriate  permutations  we  could  write  F  = 
(F |.  fj)  where  F \  corresponds  to  A'.  The  matrix  F\  is  totally  Leontief  and  has  several  desir¬ 
able  features,  one  of  which  is  that  the  set  | x:F\x  =  0,  *  >  0)  is  empty  [18],  LeX  F\  =  B  -  Q 
where  each  column  of  Q  looks  like  e,  -  ff  where  a  =  8(/)  for  some  8  6  A'.  B  then  has  unit 
vector  columns  and  each  row  has  at  least  one  + 1 . 

In  a  manner  analogous  to  the  procedures  leading  to  (6)  and  (8)  we  can  determine  condi¬ 
tions  on  p  such  that 

(9)  bPtd  <  </  —  Pt)u 

u  <  fid 

for  all  8  €  A'  and  thus  obtain  an  upper  bound  to  ys..  System  (9)  can  be  written  as 
bd  <  F\U  k  =  0 
u  ^  fid 
and 

bQ'd  <  F\u  k  =  1 
u  ^  P  d. 

The  following  result  follows: 

PROPOSITION  2, 

Let  d  >  0  and  b  satisfy 
v"+l  -  v"  <  bd 
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0() 


where  v"  €  /.  (v*)  and  v"  =  (v").  Let  l\  be  constructed  as  given  above.  Then 

V*  <  V"‘A  +  (id 

if  A  =  0  and 


/A  >  b  Max  d'x 


s.t. 


</T,.v  =  1 
/ ,  v  ^  0 
v  >  0 

or  if  A  =  1  and 


(i  >  b  Max  d'Qx 


s.t. 

d'h\x  =  1 
t\x  >  0 
.v  £  0 

PROOF:  Let  g  -  bd  if  A  =  0  and  g  -  bQ'd  if  A  =  1.  Then  g  ^  F[u,  u  <  (id  has  a  solu¬ 
tion  if  and  only  if  .v  >  0,  F\x  ^  0  implies  (3d'F\x  >  g'x.  The  rest  follows  as  in  Proposition  1 
except  here  we  note  that  the  constraint  set  is  bounded  since  \x.F\X  =  0,  x  >  0)  is  empty. 

The  dual  linear  programs  provide  upper  bounds  to  the  solutions  of  the  problems  in  Proposition 
2  and  these  in  turn  are  upper  bounds  to  yh-.  The  bounds  of  Proposition  2  are  used  as 

v*  <  \'"k  +  (id. 

The  final  case  we  consider  is  when  no  S'  can  be  determined  due,  perhaps,  to  the  necessity 
of  knowing  that  S*  €  A'.  In  such  a  case  one  is  faced  with  the  unpleasant  task  of  determining  a 
y6  for  each  b  €  A  where  p(Pf,)  <  1  and  then  using  the  largest  such  value  in  determining  /3. 
This  would  involve  solving 

(10)  Max  x'P^d 


s.t. 

x'd  -  x'Pf.d  =  I 
x’U  -  P)  >  0 
,v  >  0 

for  each  S  €  A.  Unbounded  or  infeasible  problems  can  be  ignored.  While  this  procedure 
would  be  a  considerable  task,  if  a  decision  problem  is  to  be  solved  a  large  number  of  times  with 
only  the  c,"  elements  changing,  then  it  may  be  of  value  to  determine  a  bound  for  (i  in  this 
fashion. 

As  an  example,  the  optimal  solution  values  to  ( 10)  for  each  S  €  A  of  the  problem  in 
Fxample  1  are: 


\ 
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8 

Value  (k  —  1 ) 

(1,1) 

No  Solution 

(1,2) 

No  Solution 

(1,3) 

2 

(2,1) 

No  Solution 

(2,2) 

No  Solution 

(2,3) 

1 

Thus,  without  knowledge  of  8*  one  would  have  to  use  fi  >  2b.  Note  also  that  A'  = 
and  the  procedure  of  Proposition  2  would  have  led  to  fi  >  2b  also. 


As  a  final  note,  it  is  not  always  possible  to  abstract  a  A'  C  A  containing  all  8  6  A  having 
pi.Pt,  I  <  1  with  no  8  €  A'  having  piP*)  i>  I.  For  example. 


State  Action 


/>" 
'  u 


we  find  that  piPD  <  1  only  for  8  € 


1 

1 

2 

2 

1 

2 

(1)  (1 

1  l2l 

ll]’  12 

1’  bl 

0  0 
0  1 
1  0 
0  0 

This  set  does  not  qualify  for  a  A'  set  since 


8  =  [2J  has  piPf,)  -  1  yet  8(1)  and  8(2)  are  represented  in  the  set.  Hence,  one  may  have  to 
use  (10)  instead  of  Proposition  2. 
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Rcccnii  research  Ims  led  in  sever, il  surrogate  multiplier  search  pmcedures 
lor  use  m  a  primal  branch  and  bound  procedure  Ns  single  constrained  integer 
programming  problems,  ihe  surrogate  siibpmblems  are  also  solved  via  branch- 
and  bound  Ibis  paper  develops  ihe  mner  plav  between  ihe  surrog.de  subprob 
leni  and  ihe  primal  branch  ami  bound  trees  which  *. a n  be  exploded  to  produce 
a  number  ol  compuiaiiona!  clViaenues  Most  important  is  a  restarting  pro¬ 
cedure  vvhuh  precludes  the  need  n»  solve  numerous  surrogate  suhprnblems  ai 
each  mule  ol  a  primal  branch  and  bound  nee  I  mpincul  evidence  suggests  ifi.it 
tins  procedure  ereallv  reduces  total  compulation  lime 


I.  INTRODUCTION 

Consider  the  general  integer  linear  programming  problem: 

(P)  Min  ex  subject  to  ,-l.v  Sj  b 

\  ‘  \ 

where  S  =  |.v  0:  (ix  ^  h.  v  satisfies  some  discrete  constraints'.  Here.  1  and  0  are  m  x  n 

and  <j  v  n  matrices  respectively,  with  all  vectors  having  the  appropriate  dimension. 

The  surrogate  relaxation  of  the  problem  (P>  associated  with  any  v  $  0  is 

IP')  Min  ex  subject  to  vt.Tv— />)  C  0. 

>  ■  \ 


If  we  define  the  function 

i'M  =The  value  of  an  optimal  solution  to  problem  (■)  if  one  exists 
and  -t-°°  if  the  problem  is  infeasible 

')} 
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then  clearly  ( P' '  provides  a  lower  hound  on  HP)  for  any  v  ^  0.  The  best  such  bound  is 
achieved  by  the  surronaic  dual. 

Max  (r  <  PM ) 


Only  in  rare  integer  programs  would  one  expect  such  a  dual  problem  to  directly  produce  a 
solution  to  (P).  Thus,  the  importance  of  duals  in  integer  programming  centers  on  their  ability 
to  produce  bounds  for  a  branch-and-bound  procedure.  By  careful  partitioning  of  the  constraints 
of  a  problem  into  those  which  are  relaxed  Ax  ^  b,  and  those  which  are  enforced  x  £  S,  prob¬ 
lems.  (PM.  can  be  created  which  are  easier  to  solve  than  (P).  Thus  the  bound  HPM  is  easier 
to  obtain,  and  searches  over  v  >  0  will  produce  improved  bounds.  The  successful  application 
of  duality  in  a  branch-and-bound  scheme  can  be  seen  to  depend  on  the  quality  of  these  bounds 
and  the  ease  of  computing  the  bounds,  since  one  must  repeat  the  procedure  over  and  over  with 
different  candidate  sets. 

Recent  research  (see  Karwan  and  Rardin  |6|)  has  produced  a  number  of  surrogate  multi¬ 
plier  search  procedures.  Empirical  results  ( 5 1  suggest  surrogate  duals  may  close  a  significant 
fraction  of  the  gap  between  the  values  of  the  lagrangian  dual  and  the  primal  problems. 

In  this  paper,  the  intent  is  to  more  fully  develop  the  inner  play  between  the  surrogate  dual 
and  the  primal  in  a  branch-and-bound  procedure.  When  the  two  are  considered  conjunctively  a 
number  of  advantages  are  gained  beyond  the  providing  of  a  bound  by  the  surrogate  dual  A 
number  of  general  observations  will  first  be  made  with  respect  to  the  surrogate  dual.  Then 
specific  issues  or  parts  of  the  general  branch-and-bound  procedure  will  be  developed  in  their 
relationship  with  the  surrogate  dual. 


2.  SURROGATE  SI  BPROBI.EMS 


Consider  the  surrogate  relaxation  of  ( P )  for  any  v  >  0.  Note  that  (P M  is  itself  an  integer 
linear  programming  problem  with  a  single  main  constraint  v(Ax-b)  ^  0.  Thus,  it  is  a  knap¬ 
sack  problem  with  a  set  of  side  constraints,  .v  €  S.  A  number  of  solution  techniques  have 
appeared  in  the  literature  for  the  case  of  V  =  |.v:  ,v  ^  0.  v  bounded  above).  Basically  these  can 
be  divided  into  two  categories,  dynamic  programming  procedures  and  branch-and-bound  or 
implicit  enumeration  procedures.  For  a  good  review  of  the  dynamic  programming  procedures, 
see  Garftnkel  and  Nemhauser  [3|.  It  will  soon  become  evident  that  a  branch-and-bound  pro¬ 
cedure  will  be  more  convenient  in  solving  (PM,  because  the  relation  between  the  primal  and 
knapsack  branch-and-bounds  can  be  exploited.  Moreover,  Cabot  [I],  Kolesar  [7] ,  Fayard  and 
Plateau  [21,  and  Greenberg  and  Megerich  (4|,  among  others,  have  developed  branch-and-bound 
procedures  which  proved  computationally  more  efficient  than  the  dynamic  programming 
approaches.  Finally,  Karwan  and  Rardin  [6i  have  shown  that  each  surrogate  relaxation  need 
not  be  solved  optimally.  Only  a  feasible  solution  with  value  less  than  or  equal  to  the  incumbent 
solution  value  of  the  surrogate  dual  is  necessary  for  terminating  the  solution  of  (PM.  By  solv¬ 
ing  (PM  via  a  branch-and-bound  procedure  such  solutions  are  readily  available,  require  no  extra 
computations,  and  lead  to  fewer  iterations  (choices  of  v)  in  solving  (/>s).  in  a  dynamic  pro¬ 
gramming  procedure,  however,  a  feasible  solution  is  generally  not  available  until  optimality  is 
obtained  so  that  (PM  must  be  solved  completely.  For  these  reasons,  and  more  to  become 
apparent  upon  seeing  the  inner  play  with  (P),  the  remainder  of  this  paper  assumes  surrogate 
relaxation  subproblems  are  best  solved  via  a  branch-and-bound  procedure. 
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Role  of  the  Primal  Incumbent  in  IP'IT)) 

In  branch-and-bound  procedure,  the  set  of  feasible  solutions  to  IP)  is  partitioned  into 
independent  subsets  by  an  enumeration  which  places  additional  constraints  on  integer  variables. 
The  unenumerated  portion  of  IP)  is  represented  by  a  list  of  candidate  problems,  each  of  which 
is  simply  </’)  with  certain  additional  constraints  v  €  T  appended.  To  facilitate  the  discussion, 
we  define  P (  7  )  to  be  the  same  as  (/’)  except  that  v  is  restricted  to  v  €  T  We  also  define 
r  'IP)  to  be  the  value  of  the  best  currently  known  feasible  solution  to  IP).  i.e.  the  value  of  the 
incumbent  solution  used  to  provide  an  upper  bound  on  the  optimal  solution  value. 

Note  that  el/)sl'D)  is  being  employed  us  a  bound  for  some  candidate  problem  PIT)  in 
the  primal  branch-and-bound  procedure.  However.  r  IP'IT))  is  a  valid  bound  in  PIT)  for  all 
v  0.  not  just  the  v  which  maximizes  rlP'IT)).  Thus,  IDi.IT))  need  not  be  solved  optimally 
iff  ( P '  (  71).  for  some  v  used  on  the  way  to  solving  ( /)s(  /  )).  is  sufficient  to  fathom  PIT),  i.e.. 
rIP'l  FI)  ^  r  'IP) 

Conversely,  the  value  of  the  incumbent  in  the  primal,  e*IP).  may  be  used  as  an  upper 
hound  in  solving  any  IP').  That  is.  if  no  completion  of  a  candidate  problem  in  IP')  can  pro¬ 
duce  a  solution  with  value  less  than  i*IP).  that  candidate  problem  in  IP')  may  be  fathomed. 
If  all  candidate  problems  in  the  knapsack  IP'I  T))  fail  to  produce  a  solution  with  value  less  than 
r  'IP).  then  it  can  be  concluded  that  elP'ITI)  ^  r  *IP)  so  that  the  candidate  problem  PIT) 
may  be  fathomed  in  the  primal. 

-V  CONDITIONAL  BOUNDS  AND  BRANCHING  VARIABI.KS 

The  rationale  for  the  interaction  between  the  two  branch-and-bound  procedures  with 
respect  to  conditional  bounds  and  branching  rules  can  perhaps  best  be  understood  via  a  0-1 
integer  programming  example.  Later  a  procedure  for  the  general  case  will  be  presented.  Con¬ 
sider  figure  1  which  presents  a  branch-and-bound  tree  for  the  problem  IP  'll'))  where  PH)  is 
a  given  candidate  problem  from  the  primal  tree.  This  tree  may  result  from  the  application  of 
any  branch-and-bound  procedure  for  solving  IP'IT))  The  solution  is  found  at  node  8  with 

value  i'*.  Since  the  full  tree  is  shown  and  an  optimal  solution  has  been  found.  *•' . i>: . r 

must  all  be  ^  n8. 

Now  a  number  of  important  observations  may  be  made.  If  is  .accepted  as  the  optimal 
solution  value  for  IDi.IT))  and  the  candidate  problem  PIT)  is  not  able  to  be  fathomed  (i< 8  < 
r  'IP))  then  a  branching  variable  must  be  chosen  and  a  conditional  bound  computed  for  each 
of  the  two  new  nodes  created  in  the  primal  tree.  Note  that  if  \ ,  is  chosen  as  the  branching 
variable,  then  a  valid  bound  on  any  solution  to  IPITO  i.v:  v,  =  l))l  is  given  by 
i’  =  Minte  .i'‘)  Also,  since  v*  was  the  optimal  value  of  IP'IT)),  e  ^  r\  So  even  though 
i’*  <  e*IP),  it  is  possible  that  the  bound  r  ^r'lP)  so  that  no  completion  of 
PI  TC i  (v:  V|  =  If)  will  ever  need  be  considered.  It  follows  that  v,  is  a  good  candidate  for  a 
branching  variable  in  the  primal  tree.  Note  that  a  conditional  bound  for  branching  on  v,  may 
be  taken  as  Min  (r'.r  ,i>')  for  v,  =  0  and  Min  (r\i'\r:)  for  v,=  1  One  problem  is  that  all 
of  the  end  nodes  for  which  ,v>  is  a  free  variable  must  be  included  (hence  r')  in  calculating  both 
bounds  V|  is  the  only  variable  for  which  no  free  end  nodes  may  exist,  and  we  will  choose  it  as 
the  branching  variable 

What  is  required  to  implement  the  branching  procedure  suggested  above  is  the  saving  of 
the  minimum  value  or  bound  on  the  end  nodes  for  each  of  the  two  sides  of  the  tree  defined  bv 
the  lirst  branching  variable.  An  end  node  may  be  recognized  as  one  from  which  a  fathoming 
occur-  Thus,  before  fathoming  it  is  necessary  to  determine  which  side  of  the  tree  one  is  mi 
check  to  see  if  the  bound  on  that  node  is  less  than  the  saved  bound  for  that  side  of  the  lie. 


M  il  K  ARW  AN  AM)  K  I  RARDIN 


% 

and  if  necessary,  replace  that  saved  bound.  Then  alter  solving  (/*' (’/’))  one  will  have  i< (/’'</')) 
as  the  bound  on  one  side  (rK  in  Figure  I).  and  a  bound  saved  for  branching  on  the  nonoptimal 
side  of  the  tree  (Min(i'\i'-'>  in  Figure  1) 


S  C  {  x:0  <  x  <  1,  x  integer} 

I  Mil  hi  I  ! sample  ol  .1  Hr. inch  .uul  Umiiut  live  lm  i/'  i  /  11 


4.  INTERACTION  OF  THE  SURROGATE  SEARCH  MASTER  PROBLEMS 

The  two  surrogate  dual  algorithms  which  appear  most  promising  as  discussed  in  Karwan 
and  Rardin  [6 1  both  keep  a  list  of  the  .v’s  generated  by  each  surrogate  relaxation  and  solve  a 
master  problem  involving  these  .v’s  to  obtain  a  new  surrogate  multiplier  v.  These  master  prob¬ 
lems,  one  for  each  candidate  problem  in  a  primal  branch-and-bound  procedure,  may  be  seen  to 
interact  in  such  a  way  as  to  save  a  great  deal  of  time  in  solving  (£>s)  at  any  proceeding  node  in 
a  primal  tree. 

Consider  the  primal  branch-and-bound  tree  shown  in  Figure  2  for  a  0-1  integer  linear  pro¬ 
gramming  problem.  Assume  that  a  master  problem,  or  at  least  a  list  of  the  v’  generated  in 
solving  ( [)s(<b ))  at  node  0,  has  been  kept  and  il  is  now  time  to  branch  on  v,.  Scan  the  master 

problem  at  node  0  and  place  all  .v',  (=1,2 . A  which  satisfy  x\  =  0  in  a  new  master  problem 

for  solving  (DS(T))  at  node  1  of  the  primal  tree.  All  solutions  ,v  6  S,  .v,  =  1  such  that 
ix  <  i '( 'i  have  been  made  infeasible  by  the  optimal  surrogate  multiplier  at  node  0  If 
one  is  to  ir  i  i’  ( ) )  as  a  bound  after  branching  on  ,V|,  then  all  of  these  .v’s  must  be 

included  in  i  v  master  problem  at  node  1.  This  is  valid  since  the  candidate  problem  at 

node  1  is  a  mo  onstrained  version  of  (/■*),  and  all  the  .v’s  put  in  the  master  problem  satisfy 
this  extra  constraint 


•v 
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This  procedure  may  be  continued  as  follows  In  solving  I/-M  /’>•  at  node  I,  possibly  more 
v's  are  generated  When  branching  to  node  2,  all  Vs  in  the  master  problem  at  node  I  with  v4 
=  0  may  be  pul  in  the  master  problem  to  begin  solving  the  surrogate  dual  at  node  2. 

Any  candidate  problem  may  be  chosen  to  be  explored  next  in  a  branch-and-bound  pro¬ 
cedure  and  a  number  of  strategies  have  been  suggested.  The  " last-in  first-out"  or  LIFO  pto- 
cedure  always  chooses  the  most  recently  added  member  of  the  candidate  list  to  explore.  Refer¬ 
ring  to  Figure  2.  the  nodes  have  been  numbered  in  the  order  in  which  a  I. If  f)  procedure  might 
explore  them.  Hence,  the  order  of  branching  is  from  node  0  to  node  I  to  node  2  and  to  node  2 
at  which  time  node  2  is  fathomed,  either  because  the  incumbent  solution  to  </')  was  exceeded, 
a  feasible  solution  was  obtained,  or  it  was  determined  that  \|  =  0,  v4  =  0  and  v,  =  I  precluded 
any  feasible  solution  to  I/O.  Thus  "back-tracking”  goes  to  node  4  which  is  also  fathomed,  lead¬ 
ing  back  to  node  5.  In  a  LIFO  procedure  note  that  there  are  never  more  than  two  nodes  at  any- 
given  level  of  the  tree,  a  level  being  defined  by  the  number  of  fixed  variables  or  extra  con¬ 
straints  on  I/O.  For  instance  in  Figure  2,  the  fathoming  of  nodes  2  and  5  must  occur  before 
node  6  is  chosen  as  the  node  from  which  to  branch.  In  large  integer  programming  problems, 
where  many  Vs  from  previous  surrogate  master  problems  are  to  be  stored,  storage  can  be  a 
main  concern  and  it  is  minimized  by  using  the  LIFO  branching  procedure. 


1 1* ■(  m  ?  I  sample  <>l  a  Primal  liranch-and-Houiul  lice 


The  master  problem  interactions  can  be  shown  to  be  very  efficient  in  terms  of  a  LIFO 
branching  procedure  lor  (/').  Again  consider  Figure  2  and  the  following  use  of  a  "current 
table  and  a  save  table "  At  node  0,  the  master  problem  consists  of  the  following  v's,  say  for  n 
-  dimension  of  v  =  5. 
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v1  0  0  111 
v'  10101 
v !  01100 

v4  11010 

Branching  takes  place  to  node  I.  Those  v's  which  have  a,  =  0  (a1  and  a')  are  placed  in  the 
"current  table"  for  the  "current"  or  next-to-be-cxplored  candidate  problem.  The  other  v’s  (at 
and  v4)  are  placed  in  the  "save  table"  and  it  is  noted  that  at  Idvel  1  of  the  tree,  the  next  open 
slot  in  the  save  table  is  in  row  3.  Node  I  is  now  explored  and  some  new  .v’s  are  generated  and 
put  in  the  current  table  which  becomes 

v1  0  0  1  I  I 
a  1  0  I  I  0  0 

a  '  00101 

V  0  10  11 

Now  it  is  time  to  branch  to  node  2,  so  those  .v's  which  have  x4  =  0  remain  in  the  current  table, 
i.e..  x'  and  x\  v1  and  ,vh  arc  placed  in  the  save  table  and  it  is  noted  that  the  next  open  slot  in 
the  save  table  at  level  2  of  the  tree  is  5. 

The  current  table  is  now 
v  :  0  I  I  0  0 

a'  ('  0  10  1 
and  the  save  table  is 

a-  I  0  I  0  I  !  =  , 
a4  11010 


.v1  0  0  111  ,  , 

.v'1  0  I  0  1  I 

Assume  that,  in  contrast  to  figure  2,  fathoming  occurs  at  node  2.  possibly  after  generating 
some  more  v’s.  Now  the  current  table  can  be  cleared  since  it  is  no  longer  necessary  to  explore 
any  candidate  problem  with  ,V|  =0  and  x4  =  0.  In  fact,  these  .v’s  will  never  be  generated  or 
needed  again,  since  either  x,  or  x4  or  both  will  always  be  fixed  at  I  in  any  future  candidate 
problems.  Now  the  LIFO  branching  procedure  goes  to  node  5  with  v,  =  0  and  ,v4  =  I  But 
some  of  these  v’s  are  stored  in  the  save  table  from  the  last  slot  in  the  save  table  (5—1  =  4) 
back  to  the  next  available  slot  stored  after  the  previous  level,  level  I,  which  is  slot  number  3. 
These  are  put  in  the  current  table  which  is  now 

v1  0  0  111 

v1’  0  1  0  I  I 

and  the  save  table  is  now 
x‘  10101 
a4  I  I  0  I  0 

Possibly  more  v’s  are  generated  at  node  5  and  placed  in  the  current  table.  A  fathoming  then 
occurs  at  node  5  and  a  "backtracking"  takes  place  to  node  6.  The  "other  side"  of  level  2  has 
been  explored  so  the  backtracking  must  be  to  level  1.  The  current  table  is  again  cleared  and 
the  elements  in  the  save  table  from  the  last  slot  to  the  first  slot  for  level  I  savings  (slot  1)  are 
placed  in  the  current  table  The  procedure  continues,  with  only  two  lists  being  necessary  to 
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easily  store,  update,  and  use  all  of  the  v's  generated  by  solving  surrogate  relaxations  throughout 
the  primal  branch-and-bound  procedure.  Note  that  no  v's  will  be  regenerated  using  this  pro¬ 
cedure,  and  again  that  once  the  current  candidate  problem  is  fathomed  those  v’s  may  be  taken 
out  of  storage 

The  following  is  a  formal  outline  for  branching  and  fathoming  while  employing  the 
current  and  save  tables  in  a  LIFO  branching  procedure  for  a  general  integer  linear  programming 
problem  Let 


L 

7V,  ,.77,, 

SAVBNI)  ( L ) 

NXSV 
NXCR 
NSV ( L ) 

r  *1/0 


“current  level  in  primal  branch-and-bound  tree 
=  two  new  candidate  problems  created  at  level  /.,  77/  ,  is 
candidate  problem  chosen  to  explore  next 
“bound  saved  for  candidate  problem  at  level  I.  which 
is  not  being  explored  next 
“next  available  slot  of  the  save  table 
=  ncxt  available  slot  of  the  current  table 
=  next  available  slot  of  the  save  table  at  level  /. 

in  the  primal  tree 
“incumbent  solution  value  to  (P) 


Branching: 

If  SAVBNI)  (/. )  <  r  *(P).  place  all  .v’s  from  the  current  table 
satisfying  v  €  T;,  ,  into  the  save  table,  updating  NXSV.  In 
any  case,  let  NSV  (/. )  =  NXSV  and  remove  all  .v's  satisfying 
.v  €  T, 7  ,  from  the  current  table,  closing  up  the  current 
table  and  updating  NXCR.  Determining  if  v  €  77/,  is  done 
simply  by  checking  the  single  component  of  v  upon  which  the 
branching  occurred. 

Fathoming: 

(Tear  the  current  table  by  setting  NXCR  =1.  (If  77/  >  has 
already  been  explored,  SAVBNIXZ. )  =  +  <».)  If  SAVBNDf/. )  ^  v 
replace  NXSV  by  NSV ( /.  —  1 )  and  L  by  7.-1  until  a  candidate 
problem  is  found  to  explore.  Place  rows  NSV  ( L  —  1 1  to  NSV ( /.  1-1 
from  the  save  table  into  the  current  table.  Update  NXCR 


'(P) 


After  branching  or  fathoming  more  .v's  are  generated  while  solving  ( /)s  (  77)  and  placed  in 
the  current  table  until  it  is  time  to  branch  or  fathom  again. 

Although  formally  developed  here  for  a  LIFO  branching  procedure,  the  current  and  save 
table  concept  can  be  used  for  any  primal  branching  procedure  (e  g.,  least  lower  bound)  by  scan¬ 
ning  a  single  save  table  for  .v’s  which  satisfy  the  constraints  on  the  present  candidate  problem 
As  seen  above,  this  scanning  is  done  very  efficiently  in  the  LIFO  procedure  by  simply  keeping 
an  indicator  (NSV(P))  for  each  level  (L)  of  the  primal  tree.  In  any  case,  when  a  candidate 
problem  is  fathomed,  the  appropriate  v’s  may  be  taken  out  of  storage  and  will  never  be  needed 
or  regenerated  again. 


5.  COM  PUT  ATION  Al,  ANALYSIS 


A  set  of  randomly  generated  0-1  integer  programming  test  problems  (see  Karwan  |5|>  was 
used  to  demonstrate  the  developments  discussed  in  this  paper  A  LIFT)  branching  procedure 
was  employed  in  the  primal  branch-and-bound  tree  and  the  LRMP  procedure,  (see  Karwan  and 
Rardin  [6 1 )  was  the  surrogate  dual  multiplier  search  procedure  employed. 
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lable  1  presents  the  results  of  employing  the  above  techniques  on  three  problem  sizes 
with  a  low  and  a  high  density  and  live  replications  per  cell.  One  of  the  principal  causes  for 
interest  in  surrogate  duals  is  improvement  in  bounds.  The  percent  of  the  LP  to  IP  gap  closed 
b>  (he  surrogate  dual.  i.e.. 

(«  <  /)s  )  -  l'(/-P))/(l'(P)  -  !■  (/./')) 

appears  substantial.  The  large  range  for  a  given  cell  is  perhaps  to  be  expected  with  such 
unstructured  randomly  generated  problems. 

Some  measure  ot  the  efficiency  of  the  interaction  between  the  primal  and  the  subproblem 
branch-and-hound  procedures  is  provided  by  the  remaining  columns  of  Table  I.  As  expected, 
the  principal  part  of  all  time  spent  on  candidate  problems  is  consumed  in  knapsack  subprob¬ 
lems  Values  m  column  8  range  from  7l%-82%.  However,  the  number  of  knapsack  subprob¬ 
lems  solved  at  anv  particular  node  is  quite  small  (column  6).  The  small  numbers  are  a  conse¬ 
quence  of  the  save  table— current  table  scheme  developed  in  Section  4.  Another  indication  of 
the  efficiency  of  the  save  table  approach  is  the  relation  between  the  mean  time  to  solve  the  first 
surrogate  dual  (column  4)  and  the  mean  time  to  solve  all  surrogate  duals  (column  7).  For 
larger  problem  sizes  the  average  surrogate  dual  — which  begins  with  many  .v*  saved  from  previ¬ 
ous  knapsacks  — solves  in  5-10%  of  the  time  for  the  first  dual. 


TABLE  I.  Primal  Branch -and- Bound  Empirical  Results 
(five  Replications  per  cell) 
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EXTREME  SOLUTIONS  OF  THE  TWO 
MACHINE  FLOW-SHOP  PROBLEM 
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School  of  Business  Administration 
l  nm  rsitv  of  Wisconsin -Milwaukee 
Milwaukee.  Wisconsin 


ABSTRACT 

I  he  paper  provides  a  new  theoretical  framework  to  identity  extreme  solu¬ 
tions  of  the  t\A o  machine  How -shop  problem  Some  remarkable  properties  of 
these  solutions  have  been  developed  As  a  result  the  problem  of  generating 
minimal  solutions  can  be  decomposed  into  a  number  of  smaller  subproblems 


1.  INTRODUCTION 


The  well  known  iwo  machine  flow-shop  problem  can  be  formalized  as  follows  f  ind  a 
permutation  P  =  p\.  p: . p„  of  numbers  S.  2 . n  that  minimizes 


(1) 


UP)  = 


max 

s  u  % 


£  -4/>,  +  X  Bp, 

r  *  I  r—u 


where  A,.  Br,  r  =  1,2 . n  are  given  positive  numbers.  According  to  the  flow-shop  termi¬ 
nology  TiP)  is  the  completion  time  of  n  items  processed  in  a  sequence  P\.  p, . p„  while 

A,  and  B,  are  operation  times  of  item  r  on  machines  A  and  B.  Each  item  is  to  be  processed 
first  on  A ,  then  on  B. 


Let  /  =  ( 1 .  2 . n )  be  the  set  of  all  items,  and  /'  and  j  two  arbitrary  items  of  1.  Intro¬ 

duce  the  following  relation 

(2)  R(iJ)‘=  lmin(A,,B,)  <  min(A,,B,)]. 

Notice  that  R  ( i,j )  or  R  (j,i)  holds  for  every  pair  i.j  €  /.  We  say  that  P  =  p , ,  p: . p„  is  an 

R  —sequence  if 

(3)  i  <  j  Rip,.  Pj).  V/,  j  €  /. 

As  shown  in  Section  2  every  /^-sequence  minimizes  (1).  The  set  of  /^-sequences  is  usu¬ 
ally  a  small  portion  of  the  set  of  all  minimal  solutions. 

This  paper  examines  the  properties  of  extreme  (minimal,  maximal)  solutions  of  the  flow- 
shop  problem  It  provides  necessary  and  sufficient  minimality  conditions  (Section  3)  simpler 
than  those  of  [41  along  with  sufficient  maximally  conditions  (Section  6).  It  introduces  a  critical 
item  concept  (Section  4)  that  leads  to  several  remarkable  properties  of  extreme  solutions.  As  a 
result  the  problem  of  generating  minimal  solutions  can  be  decomposed  into  several  smaller 
several  smaller  subproblems  (Section  5) 
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2.  //-SEQUENCES 

Lei  II  he  the  set  of  all  ^-element  permutations  of'  1,2 . //.  We  will  use  symbols 

P.  Q.  P,  Q,  Pk  to  indicate  those  permutations.  Let  Q  =  <r  i/n ,  and  Q  =  it  jin  be  two  ele¬ 
ments  of  II 

LI  MM  A  I  (21  [/(CM  $  /'((/)!.  V.rt/TT.  it  jin  €  II 


PROOF  Due  tod) 


7  (CD  =  ntaxj  £  A,  +  Tin).  £  A,  +  /'(//)  +  £  B. .  Ti<r)  +  £  //,|‘ 
TiQ)  =  maxjX,  .4,  +  Tin).  £  .4,  +  7  (.//  >  +  £  P,.  T(«r)  +  £  P,j. 


Consider  inequality 
(4)  ] T < // >  7  (.// ) 


which  is  equivalent  to  (2).*  The  theorem  holds  since  (4)  implies  TiQ)  $  7" ( C7  *• 

Let  P  -  />,.  /5_. . /?„  be  an  //-sequence 

THEOREM  I:  P  minimizes  ( I ). 

PROOF:  Consider  an  arbitrary  sequence  P  €  11.  P  ^  P  Then  P  =  irp,p,n  for  some 
/  <  /.  According  to  (3)  and  Lemma  1,  Tiirp.p^)  ^  HP').  Hence,  P'  along  with  every  per¬ 
mutation  other  than  P  can  be  eliminated  from  II  as  nonoptimal.  The  well  known  Johnson’s 

Algorithm  (21  of  constructing  sequence  P  =  pt.p; . p„  can  be  defined  in  the  following 

manner 

STEP  I  Find  min  (min  A,,  min  H, ). 


STEP  2 

(a)  If  the  minimum  is  at  A:/  <  B,  define  J  =  (/» t }  where  />,  is  the  element  with 
the  smallest  subscript  among  the  elements  of  the  set  {r\A,  = 

(b)  If  the  minimum  is  at  B,  <  A,  define  J  =  \p„)  where  p„  is  the  element  with 
the  largest  subscript  among  the  elements  of  the  set  \r\B,  =  Pj 

STEP  3:  Replace  /  by  /  J  and  repeat  Steps  I  and  2  until  all  elements  of  Pare  deter¬ 
mined 

COROLLARY  I  Johnson's  Algorithm  produces  an  //-sequence. 


*Tiir ).  / ).  Hit  l.  /  (jii  are  defined  by  <1 )  lor  sequences  n.  »r  //.  //  Hence, 

rtiO  -  maxi  1,  *  H,  +  H,  \,  4  1  ♦  /n 

/(//)  -  maxi  \,  +  4  /*,,  I  4  +  /n 

*fc>  see  i(  subtract  1  +  H  +  I,  ♦  from  both  sides  ol  (4> 
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PROOF:  Let  P  =  1.2 . n.  Assume  that  RU.j)  does  not  hold  for  some  /  < ./.  Then 

minM,.#,)  >  minM,,fl,).  Consequently,  Johnson's  Algorithm  will  place  element  j  in  front  of 
element  /  contrary  to  our  assumption,  Q.E.D. 

Introduce  the  following  notations: 

E,  =  A,  -  B,.  I  =  (rlr  €  /.  E,  <  0|. 


I*  =  (rlr  €/.£,>  0],  /"=  I rlr  €/,£,=  0|, 

Let  P  =  />,,  //, . A,  be  an  arbitrary  //-sequence.  Then  the  following  obvious  proper¬ 

ties  hold: 

PROPERTY  1:  The  elements  of  /  are  arranged  in  a  nondecreasing  order  of  the  A,  and 
precede  the  elements  of  /*  that  are  arranged  in  a  nonincreasing  order  of  the  Br 

PROPERTY  2:  The  elements  of  /"  can  be  placed  in  any  order  as  long  as  they  do  not  pre¬ 
cede  (follow)  an  item  with  a  smaller  Ar(B, ), 

PROPERTY  3:  Any  subsequence  of  P  is  an  /{-sequence.*  Consider  a  sequence  <r,/r  c  /, 
and  an  /{-sequence  rr,  where  n  C  /  -  <r . 

PROPERTY  4:  T(crir)  <  T(<ttt).  T(rta)  <  T(tt'<t)  for  all  possible  permutations  7r  of 
the  elements  of  ir . 

PROOF:  According  to  ( 1 ) 

TUm)  =  maxI^T,  A,  +  T(tt).TUt)  +  £ 


£(< ttt  )  -  max 


£  A,  +  T(rr  ) ,  T(a)  +  £ 

,r  TT 


Hence,  7" ( tt )  ^  T(ir  )  ^  TUtit)  ^  TUrn  ),  Q.E.D. 

One  can  similarly  prove  £(7r<r)  <  T(tt(t).  According  to  Property  4,  to  find  a  sequence 
that  minimizes  (I),  provided  o-  is  fixed,  arrange  the  items  that  follow  (precede)  <r  in  an  R- 
sequence. 

This  rule  may  not  be  valid  if  >r  occupies  a  middle  position.  Consider  the  following  exam¬ 
ple  (Figure  I ): 


1 
2 

3 

4 

I  Kil  KI  I 


A,  Br 


2 

3 

6 

8 

9 

10 

3 

1 

’  I  his  docs  not  mean  ihal  a  subsequence  of  run  optimal  sequence  is  optimal  Nee  Remark  I.  Section  N 
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Assume  that  we  are  to  find  a  sequence  that  minimizes  (1)  where  a  —  3  occupies  the  second 
place.  Although  124  is  the  only  /?-sequence  of  /  —  a  —  (1,2,4),  1324  is  not  the  best  sequence 
since  r (23 14)  =  29  <  T(  1324)  =  30. 

3.  NECESSARY  AND  SUFFICIENT  OPTIMALITY  CONDITIONS 

Define  Wiir)  =  Tin)  -  ^  Br,  for  n  C  I.  Then 


(5) 


W(P)  =  T(P)  -  T  Br  =  max 

I  <  «  <  n 


1/  I 

+  L  f'/. 

r-  I 


Consequently,  the  minimization  of  (1)  is  equivalent  to  that  of  (5).  As  known  WiP)  is  the  idle 
time  of  machine  B  while  processing  sequence  P.  Let  a  and  7r  be  two  sequences,  n  C  /  —  a . 

PROPERTY  5:  Wian)>  W(a) 

PROOF:  According  to  (5) 

WOrr).£  E,  +  Win  )|,Q.E.D. 


W (an )  =  max 


Observe  that  Wia ir)  ^  Win)  may  not  hold.  Consider  two  sequences  P  =  ayin  and 
Q  =  a  iyir ,  define  the  following  conditions. 

(6)  A,  s?  W(P)  -  £  £,, 

(T 

(7)  A,  -  B,  SC  WiP)  -  Wiy). 


For  P  =  cry/w  and  Q  =  trytr  formula  (7)  becomes 
(7a)  A,  -  B,  ^  W(P)  -  ££,  ~  A,. 


We  will  show 

PROPERTY  6: 

£,  ^  0=»  ((6)  [WiQ)  ^  IC(/>)]!, 

£,  >  0  =£►  (1(6)  and  (7)]  )C(P)1|. 


PROOF:  Due  to  (5) 

(CIP)  =  maxi  l V(a),W(y)  +  ££,,4,  +  ££r,  IFIrr)  +  ££,], 

I  •!  <r  y  „y,  J 

WiQ)  =  maxi  Wia). A,  +  L£|.  )C(y)  +  It  +  E,,W(tr)  +  ££,|. 

[  .r  .r  niy  1 

If  £,  ^  0  then  WiQ)  <  ( £* )  whenever  A,  +  ££,  ^  fHP).  On  the  other  hand  if 

4,  +  ££r  >  IF(£)  then  WiQ)  ^  A,  +  ££r  >  WiP).  Q.E.D. 
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One  can  similarly  prove  case  /:,  >  0. 

Assume  that  P  is  an  optimal  sequence,  which  means  that  P  minimizes  (I)  and  (S). 

COROLLARY  2:  Q  is  optimal  if  and  only  it  one  of  the  following  conditions  hold: 

1  (61  if  E  <  0,  or 

2  (6),  (7)  if  >  0. 


Consider  the  following  example  (figure  2) 


1 

2 

3 

4 


P  =  1234  optimal  and  H  IP )  =  5  Let  <r  =  <6.  /  =  2,  3.  Permutation  2134  is  optimal  due  to 
F.:  <  0.  and  (6)  (3  <  5  -  0).  while  sequence  3124  is  not  optimal  (6  <  5—0). 

Assume  <r  =  (1).  y  =  (2).  i  -  3  Both  conditions  (6)  and  (7)  are  met  (6  <  5  -  ( —  1  > , 
6  -  4  $  5  -  (-11-3)  Consequently,  1324  is  optimal  Observe  that  neither  2134  nor  1324  is 
an  K-sequence 

RLMARK  1:  Although  1  324  is  optimal  its  subsequence  132  is  not.  since  7" ( 1 3 2 )  = 
16  >  7~(  1  23 )  =  14 

Usually  the  number  of  optimal  solutions  far  exceeds  the  number  of  /^-sequences.  Con¬ 
sider  the  following  case: 

E,  >  0.V/,  max  B  <  min  A,.  B ,  ^  B,,Vi  &  j. 


A,  Br 


1 

2 

3 

5 

6 

4 

6 

3 

I  k, i  to  2 


While  there  is  only  one  /^-sequence  the  number  of  all  optimal  solutions  (where  the  last 
element  is  an  item  with  the  smallest  B, )  is  (n  —  1)1. 

4.  CRITICAL  ITEMS 

Element  u  is  a  critical  item  of  an  optimal  sequence  P  =  (run  if 
WIP)  =  Au  +  ££,  (or  TIP)  =  S,  Ar  +  £  Br |. 

iT  \  <ru  tin  i 

Assume  in  this  section  E,  ^  0,  r  6  /  Let  P  =»  <r  yr  oii/tt  ,7r  2  be  an  /^-sequence,  and  u  its  criti¬ 
cal  item  Suppose  we  move  u  upward  in  front  of  <r ,/,  or  downward  behind  jrr ,.  Will  the  result¬ 
ing  sequence  be  optimal?  The  following  theorems  resolve  this  issue. 

THEOREM  2:  Q  =  <r  , u<r  ,i/n  ,tt  ,  is  optimal  if  and  only  if 
E,  >  0.  B,  =  B„.  r  €  <r  au. 


(8) 


I  OX 


U  S/U  tK< 


PROOF 

-=>  I  If  A„  <  0,  and  A,  <  0,  r  €  <r  ,<r ,/  Hence, 

M'K>)  3?  W'(<r,(0  3s  ,l„  +  £/;,  >  A„  +  X  A,  =  HO/*). 

,r  ,  .r  j.  r  ,i 

contrary  to  the  assumption,  0  I  I) 

2,  If  A  <  0  one  can  show  as  before  that  W{Q)  >  MO/’),  Q.F.l) 

Since  A,  >  0  and  A,  >  0  then  B  S?  B  (Property  I) 

3.  If  B  >  B„  then 

M  <  t> )  3s  W'  (cr  ,  hit  ■>/ )  3*  A,  +  X  E'  =  4  +  Z  £  +  -4«  -  > 

•  t  j  m»f  .  -r  ,.f  , 

>  .4,  +  X,  £  +  •-»,  -  B,  =  MO/’), Q.F.l). 

A  >  0,  A„  >  0  and  A  =  R„  imply  (X)  due  to  Property  1. 

<=  Condition  (8)  along  with  Property  I  imply  that  Q  is  an  R-sequence.  One  can  similarly 
prove  the  following. 

TIIF.ORFM  3:  Q  =  <r  pr  A  in \iin  >  is  optimal  if  and  only  if 
(9)  A,  <  0.  .4,  =  .4,,.  r  €  ujn  , . 

Consider  sequences  0  and  £>  of  Theorems  2  and  3. 

PROPKRTY  7:  1 .  If  (3  is  optimal  then  /  is  its  critical  item, 

2.  If  Q  is  optimal  then  ./  is  its  critical  item. 

PROOF:  The  optimality  of  Q  implies  (8).  Hence, 

W{Q)  3s  MO<r  ,«</  ,/)>  A,  +  X  A,  =  -4,,  +  Xt=  »'</’)-  FK ( C? ) . Q.E  D 

<r  j//if  ,  <r  j.r  ,< 

I  he  proof  of  the  second  part  is  symmetrical. 

Due  to  (81  and  (9) 

IA (  Q  )  >  ft  Ur  | u  ) ,  ft' ((3)  >  MOir  ,<r  w/ jr  , «  ) . 


Hence,  u  is  no  longer  a  critical  item  of  Q  or  Q  . 

Suppose  that  we  move  element  i  of  an  R-sequence  P  —  <f  i<r  ;utt  ptr ,  ahead  of  its  critical 
item  u  The  following  theorem  resolves  the  optimality  issue  of  the  resulting  sequence. 

rilF.ORFM  4  Q  =  ir  ti<r  ,utt  ,7t  ,  is  optimal  if  and  only  if 
MO)  A„  <  0.  A,  <  0,  <r,  =  4>,  4,  =  4„. 


PROOF 
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=$>  If  E.  >  0,  then  (Property  5)  W(Q)  ^  ff'Ur  tur  ,u  I  >  A„  +  ^  V.,  >  ,4„  +  £  /.,  = 

'f  I  Ml  ,  <T  ,<f  , 

HI/’),  which  is  in  contradiction  with  the  optimality  of  Q.  Hence.  /.,  <  0  This  implies 
/.',  <  0.  r  €  <T,ir?uv,  and  /l,  ^  (Property  I). 

(T(C>)  3*  W’Ur,/)  ^  .4,  +  £/„  >  A„  +  ££,  =  (Fl/D. 

,*r  , 

Thus.  W{Q)  >  H  (/’)  if  <r  ,  ^  <b  or  A ,  >  4„,  Q.F.D. 

Due  to  Property  1  (/’is  an  /(-sequence)  condition  (10)  implies  E,  <  0.  A,  --  ,4„, 
r  €  u  tt  | #.  Hence.  (/ is  an  /(-sequence,  Q.F.D 

Optimal  Presequences: 

Given  an  /(-sequence  P  =  an  €  II,  then  n  is  also  an  /(-sequence  (Property  3)  Consider 
a  permutation  <J  -  an  6  11. 

We  say  that  <r  is  an  optimal  presequence  when  Q  =  <ttt  is  optimal.  Q  is  uniquely  deter¬ 
mined  for  each  <r ,  once  P  is  given.  Hence,  to  find  all  optimal  sequences  it  is  sufficient  to  gen¬ 
erate  all  optimal  v-element  presequences  for  each  v  55  n  — 1,  given  an  /(-sequence  P 

REM  ARK  2:  According  to  Properly  5  presequence  it  i  may  be  optimal  only  if  it  is 
optimal. 

REMARK  3:  Formulas  (6)  and  (7)  allow  to  determine  the  optimality  of  presequence  it  / 
provided 

1.  it  is  already  known  to  be  an  optimal  presequence. 

2  /’ is  a  known  /(-sequence.  _ 


Lei  /’  =  a u(i  be  an  /(-sequence  and  u  its  critical  item.  Consider  another  sequence  Q. 

THEOREM  5:  If  Q  is  optimal  then 

1 .  The  elements  of  au  precede  those  of  p  whenever  E„  >  0,  or 

2.  The  elements  of  up  follow  those  of  «  whenever  E„  <  0. 


PROOF: 

CASE  £„  >  0:  Let  Q  =  ittt  where  it  is  an  optimal  presequence.  Assume  that  au  is  a  k- 
element  sequence  (k  ^  n  -  I).  For  each  s  <  k  consider  sets  of  s-element  optimal  prese¬ 
quences  it  According  to  Theorem  4  no  element  of  P  belongs  to  an  optimal  <r  if  s  =  1.  Due  to 
the  same  Theorem  and  Remark  2  this  is  also  true  for  v  =  2,  3 . k ,  Q.F.D. 

The  proof  of  second  case  is  symmetrical. 

5.  GENERATING  OPTIMAL  SEQUENCES 

Consider  an  /(-sequence  P  =  iTiujn  where  the  critical  item  u  is  the  .s-th  element  of  P 
Theorems  2,  3  and  Property  7  imply: 

COROLLARY  3:  If  none  of  the  conditions  (8)  and  (  )  holds  then  u  remains  the  v-th  ele¬ 
ment  of  every  optimal  sequence 


no 


W  S/WAK( ' 


Elemen!  u  is  also  a  critical  item  of  every  optimal  sequence.  To  see  it  assume  that 
Q  =  <»  u{i  where  <*  and  fi  are  permutations  of  elements  of  rr  i  and  in  ,  respectively.  Then, 

H  (Q)  ^  WUtu)  >  4„  +  £/;,  -  4„  +  £/;,  =  ( /^ )  =  )T<(?).Q.E.D 

5. 1  Let  P  =  a„M  |«  |M,  . . .  <»v  be  an  ^-sequence  where  none  of  (8)  and  (9)  hold  for  criti¬ 

cal  items  ur.  I  ^  ^  r/.  We  will  show  that  the  problem  of  generating  optimal  presequences 
<r  C  /  can  be  decomposed  into  q  +  1  separate  subproblems  Consider  an  optimal  presequence 
it  —  <»„«,«  1 1/,  ...  o,  {u,(r,,  i  SJ  r/  where  «\  is  a  permutation  of  the  elements  of  a,  while 
(t  ,  C  <»,,  for  each  0  ^  s  I.  Then, 

(ID  IT</>)  =  4„  A  ££,  -  ££,.  1  <  t  <  q. 

it  //,*r  , 

Formulas  (6)  and  (7)  remain  in  their  original  form  for  t  =  0  while  for  t  ^  1  they  become 
(b  )  4,  <  -  ££,. 

IT 

17  )  4,  -  fi,  <  B„  -  ££,  -  W(y). 

IT  f 

where  <r,  C  «,.y,  C  To  illustrate  the  decomposition  technique  along  with  the  generating 
procedure  consider  the  example  of  reference  [3]  (Figure  3). 
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P=  12345  is  an  /?-sequence,  W(P)  =  4  and  u i  =  3.  t/s  =  4.  Uy  =  5.  Consequently, 

<*0  ”  (1.2),  «|  -  a2  =  =  <t>.  Since  the  assumptions  of  Corollary  4  are  met  for  all  u,  (they 

automatically  hold  for  u2  and  Uy  since  «j  =  <»i=d>)  every  optimal  solution  Q  =  ...345.  It 
only  remains  to  find  optimal  one  element  presequences  of  a()  since  an  is  a  two  element  set. 
Due  to  E,  <  0.  r  €  an  it  is  sufficient  to  check  (6).  Presequence  2  is  optimal  since  (6)  holds 

for  /  =  2,  (t  =  (see  Remark  3)  in  addition  to  the  known  optimal  presequence  1.  Conse¬ 

quently,  12  and  21  are  optimal  arrangements  of  an.  There  are  only  two  optimal  sequences 
12345  and  21345.* 

5.2.  Consider  some  critical  item  u  of  an  R- sequence  P  where 

1.  (8)  holds  for  some  <t =  /' | / 2  ...  /, /',  or 

2.  (9)  holds  for  some  jn  ,  =  jj t  . . .  /,. 

According  to  Theorems  2  and  3  we  can  generate  ^-sequences,  say,  Pk  by  arranging  the  ele¬ 
ments  off r  2iu  or  ujn  2  of  Pin  the  following  manner: 

(12)  /|/>  . . .  iju.  ui\i2  . . .  ij .  /,/ ,  . . .  ijui , 


‘the  authors  of  131  using  a  lexicographic  search  procedure  examined  tin  this  example)  lower  hounds  for  9  prese- 
quences  with  the  number  of  elements  ranging  from  3  to  5  ( in  7  prcsequcncesh 
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(13)  Ujj  I  JJ I  ■■•AM .  A  »H  I  /  .'  •  •  •  /,  I 

In  view  of  Property  7  the  critical  items  are  the  last  elements  of  the  sequences  of  (12)  and 
the  first  elements  of  the  sequences  of  (13)*. 

To  find  the  optimal  permutations  we  apply  the  procedure  of  Section  5.1  to  each  l\  assum¬ 
ing  that  none  of  its  critical  items  can  be  moved. 

To  illustrate  this  case  consider  the  following  example  (Figure  4) 
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P  =  T 234567  is  an  37-sequence,  W(P)  =  3.  U\  =  1,  u2  =  5. 

The  dashes  indicate  the  critical  items  of  P. 

Since  (8)  and  (9)  hold  for  <r  ,/  =  (4)  and  jirt  =  (2)  four  /?- sequences  are  generated  (see 
(12)  and  (13)). 

P,  =  1234567.  P,  =  2134567,  />,  =  1235467.  P4  =  2135467. 

To  generate  optimal  sequences  out  of  P\  observe  that  P \  =  anU\a\U2a2  where  «n  =  <b 
a,  =  (2,3.4)  (6.7).  According  to  (6)  and  (7)  the  list  of  optimal  arrangements  of  a  \  and 

a is  234_,  243,  423  and  67,  76  respectively.  Cojisequently.  P \  generates  six  sequences 
1234567.  1243567.  1234567,  1423576,  1 243576.  1423576. 

To  find  the  remaining  optimal  sequences  we  have  to  verify  (6)  and  (7)  for 
a  j  =  (1.  3.4),  (2,  3.  5)  and  (1.3.  5)  since  «;  =  (6.  7)  is  the  same  for  all  four  sequences  Pk.  The 
total  number  of  optimal  solutions  is  24  while  the  number  of  37-sequences  is  4.  . , 


5.3.  Consider  the  case  when  /;,  =  0  for  r  €  /"  ^  <b.  Let  P  be  an  37-sequence.  We  can  assume 
(Property  1)  that  P  =  o/3y  where 

/;,  <  0.  r  €  «,  F,  =  0.  r  €  fi.  F.,  >  0,  r  €  y. 

Let  max  A,  =  .4,.  Consider  sequence  <»y. 

.  •  u 

THEORKM  6:  WiP)  =  max!  W(ay).A v  +  ££', ) 

<» 

PROOF:  Let  u  be  a  critical  item  of  P  Examine  three  cases: 

1  u  €  y.  Then  P  =  aliytuy}  and 

HAP)  =  A„  +  £  Er  -  A„  +  ££,  =  (C(«y). 

„l>r, 

2  u  £  a.  The  proof  is  similar  to  that  of  the  previous  case. 

3.  u  £  fi .  P  =  o/3  |  u/3  yy ,  and 

«  (n  -  /<„  +  I/:,  =  a„  +  ££,. 


W  S/.WARC 


1 1 2 

Expression  .4,,  +  ££,  is  maximal  for  u  =  v,  Q.E.D. 

Theorems  2,  3,  and  4  remain  valid  even  for  E„  =  0  as  long  as  E,  ^  0,  for  r  £  /  -  u. 

We  offer  ihe  following  procedure  of  generating  optimal  sequences: 

STEP  1  Delete  set  /"  from  /  and  find  an  /?-sequence  ay . 

STEP  2  Apply  the  generating  procedure  of  Section  5.2  to  sequence  w  where 

oy  if  .4 x  +  y /:,  <  H-'foy), 
ovy  i  f  A  ,  +  £  E,  ^  W  (o  y  ) . 

STEP  3:  Eor  each  sequence  it  generate  n-element  optimal  sequences  by  placing  the 
remaining  items  of  fj  in  the  appropriate  places  using  formula  (6). 

To  illustrate  the  procedure  expand  the  example  of  Figure  2  by  adding  two  new  elements  5 
and  h  where  I ,  =  //,  =  8.  .4,,  =  //,,  =  5. 

SUP  1  We  already  know  that  oy  =  1234  is  an  /^-sequence. 

STEP  2  tt  —  12534  since  .4,  +  =  W  (ay)  =  5. 

Observe  that  a>  =  (3).  and  elements  5.  3,  4  cannot  be  moved  Handling  seta«=  (1,2) 
we  obtain  two  optimal  sequences  12534  and  21534. 

STEP  3  Condition  <61  for  i  =  6.  MAP)  =  M Toy)  =  5  becomes 
(14)  5  sj  5 

Consider  Figures  5  and  h  where  the  £/.,  are  written  on  the  margins  of  the  tables  (except 
=  0  for  o-=<t>) 
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According  to  (14)  presequence  <r6  is  optimal  if  and  only  if  <  0.  Hence,  element  6 

»r 

can  be  placed  everywhere  as  long  as  it  precedes  element  4.  Consequently,  there  are  ten  optimal 
sequences. 
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6.  MAXIMAL  SOI TTIONS 

I  /’  be  an  ^-sequence  produced  by  Johnson's  Algorithm.  It  is  easy  to  see  that  a 
reversed  sequence  /’  -•  /».  .  p  .  .  p  ntaximi/cs  111  Without  loss  of  generality  we  can 

assume  p  -  1.2 . n.  and 

/.,  >  0.  r  I.  /.,  %  0.  r  H  t  t  I, 

1  ,V  H  %  H.  <  . .  .  $  H  .  I  . .  ^  ,f ...  £  ...  ^  l„ 

lot  some  0  $  /  n  * 

I  III  OKI  A1  7  |  lenient  i  or  /  -*  1  is  a  critical  item  of  lJ 

I’ROOI  Define 

C  -  I  '  I* 

■  i  •  i 

then. 

Ml/')  =  max  A'„ 


CASK  1  I  ^  ^  i.  Then  H,  ^  II, .  {  and  H,.\<  -I, .  t  imply  H,  <  l,.|.  Consequently, 

K  <  k  .  | . 

CASH  2:  /+!</'<«.  Then.  I,  <  I,  i  and  I,  ,  ^  /f,  |  imply  f,  <  H,  Hence, 
k  ^  k  |.  Combining  both  cases  we  have 

k  <  k<<  k  and  K  .  i  >  k  2  ...  ^  A'.,. 

Thus.  W  (/•’  >=  max  A'.,.  Q.H.l). 

Let  ii  be  the  critical  item  of  a  reversed  Johnson  sequence  lJ '  —  1 . 2  . . .  .  n.  It  is  easy  to  see  that 

any  sequence  Q  ~  nun  maximizes  ( 1 1  as  long  as  a  is  a  permutation  of  1,2 . //  —  1  while  fi 

is  a  permutation  of  /m  I . n. 

COROLLARY  4  The  minimum  number  of  maximal  sequences  is  It/  - 1  II,  where 

u  =  t  or  t  +  I 

Reversing  a  minimal  (non  K)  sequence  does  not  necessarily  produce  a  maximizing 
sequence  Consider  the  example  on  f  igure  2.  Mthough  1424  is  minimal  the  reversed  permu¬ 
tation  423  I  does  not  maximize  1 1  t  since  I  <423  1 1  =  2 1  <  7  (4321 )  =  23  Let  I3'  =  1.2 . n 

he  a  reversed  Johnson's  sequence.  Consider  a  set  of  n  —  1  —  element  permutations  it  of 
numbers  r  e  /  l/l  It  is  obvious  that  rr  =  1.2 . /-I./  +  I . n  maximizes  H'(tt) 

Due  to  I  hcorem  7 


W  <  H)  - 

> 

II 

+ 

where 

(16) 

L  •  i 

^  li, ,  i,  ^  H, ,  |.  if  a  =  / 

•Ci  ^ 

II,.  I, .  s  ^  li, .  |.  if  m  =  /  +  1 

'  1  <  >1  !  II  IK)  / 

*'>  while  l«n  1 

ii  no  / . '  o 

i 
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w  hile 

II  (77-  )  -  A  ,  A ,  / 

w  here 

v  "■  11 

«  1  or  //  *  I .  if  /  //. 


I  111  ORI  M  X  11(77  I  t  II  (/’  1 
PROOI  I  here  are  mn  eases 

t  x-  a  I  |f  /  v  a  ihen  1  t.  and  /  >  0  I  lence,  II  ( /’  )  H  (77  )  -  /;,  >  It 
2  If  /  >  //  ihen  II  < /*' »  -  H  <jt  ).  0  12- 1 ) 
n- u:  Pour  suh  eases  are  (o  he  considered 

I  tt  =  1.  t  1 .  2  11  -  1.  '  H  .  4  //  —  /  t  1 .  \  4  1/  =  M  I .  v—  ■'  +  2 

li  (/’  t  II  (77  1  -  li  I  lease  I).  I  I  (case  2).  li  -  I..,  lease  .41. 

I  .  I  (ease  4i 

Ihe  nonnegativ  itv  ol  U  ( /’  )  II  < rr  1  follows  direed>  from 
I  1 16)  lor  eases  I  and  3 

2.  1 15)  and  1 18)  for  eases  2  and  4(1,.,  <  //,.  li,  <  -t,  -  case  2). 

Lei  <r  he  a  permutation  of  numbers  r  1  ./  where  J  is  a  proper  suhsei  of  /  Theorem  8 
implies 

PROP1RIA  8:  max  It  (<r  I  ^  max  HI/’) 

/■ 

Propertv  8  does  noi  iniplv  H'(<r  1  HI/’),  To  see  ii  consider  the  following  example 
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let/’  -  I  23  and  o  -  13  Still  (((/’)=  4<  H  (rr  I  =  II) 
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A  THEORETICAL  AND  C  ()\1Pl  TATION  A  I.  COMPARISON 
OF  "EQUIVALENT"  MIXED-INTEOER  FORMULATIONS* 
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1.  FOR  Ml  I  ATIONS:  FQl  I  VAI.F.  NT  AM)  OPTIMAL 

This  paper  provides  a  theoretical  and  eompuianonal  comparison  of  aliernaiive  mixed 
integer  programming  formulations  lor  optimization  problems  involving  certain  types  ol 
economy-of-scale  functions  Such  functions  arise  in  a  broad  range  of  applications  Itom  such 
diverse  areas  as  vendor  selection  and  communications  network  design.  \  "nonstandard”  prob¬ 
lem  formulation  is  shown  to  he  superior  in  several  respects  to  the  traditional  formulation  ol 
problems  in  this  class 

This  first  section  describes  a  rigorous  approach  to  formulating  certain  optimization  prob 
lems  through  the  use  of  "minimization  models"  |4.5, hi  The  minimization  model  concept  is 
then  used  as  the  basis  for  defining  a  family  of  "equivalent"  formulations  .is  well  as  a  means  of 
defining  an  "optimal"  formulation.  Sections  2  and  a  establish  the  optimality  of  a  very  compact 
formulation  for  functions  belonging  to  a  class  of  economy-ol-scale  functions  Computational 
results  lor  a  communications  network  problem  arc  then  given  to  illustrate  the  superiority  of  this 
formulation  as  compared  to  a  "standard"  formulation  of  the  problem 

I  he  economy -ot -scale  property  that  we  will  consider  is  encountered  in  a  broad  vanctv  ol 
cost  functions  foi  goods  ranging  from  doughnuts  to  telecommunications  links  Roughlv  speak¬ 
ing.  a  function  is  said  to  have  an  economy-of-scale  properly  if  the  cost  (per  until  of  a  commod¬ 
ity  decreases  if  certain  "large"  quantities  ol  the  commodity  are  purchased  A  simple  example  ol 
such  a  cost  function,  hut  one  which  serves  to  illustrate  some  of  the  properties  that  we  wish  to 
consider,  is  a  cheaper-hy -the  dozen"  function  defined  as  follows  let  i,  denote  the  number  ol 
vmg/e  units  ol  a  commodity  with  the  cost  per  s/iic/c  unit  being  a  /Ki\/rnc  constant  i  .  let  t 
denote  the  < nonnegative,  integer  I  number  of  i/nreirv  (groups  of  I2>  purchased,  the  price  per 
dozen  being  a  positive  constant  c  •  I  2<  .  (so  that  it  is  i/ic.i/vr  to  purchase  a  dozen  than  it  is  to 
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purchase  12  single  umtsi  ami  lei  k  l\  l  (see  figure  I  lot  a  typical  k  (\  II  denote  (he  "cheaper- 
hv -thc-do/en”  lunelmn  lhai  represents  ihe  minimum  cost  ol  purchasing  at  least  \  units,  i nr 
simplicil\  in  this  example.  \  and  i  ,  will  he  assumed  to  he  mniiinmus  variables  ll  is  easily  seen 
that  k  ( v  I  can  be  compactly  icpresented  as 

(111  I  U  i  -  mm  ,  i  i  i  i 

s i  i  '12  i  •  v 

i.i  ■  ().  i  integer 

I  hal  is.  substituting  am  conslanl  v  lor  v  in  the  right  hand  side  id  ll  M  yields  an  optimization 
problem  I  in  the  variables  t  and  i  whose  optimal  value  is  precisely  k  iv).  Of  course,  Ihe 
piecew ise-hnear  function  k  I  v  I  can  he  lepresenied  in  many  other  ways.  but.  as  will  he  seen.  Ihe 
representation  (I  I  »  is  not  only  compact  hut  also  is  useful  in  formulating  optimization  problems 
mvolv  ing  k  f \  >. 


I  he  KIIS  nl  i  I  I  I  is  an  example  of  a  /mu  </-//»<  er<  mmtmt:aumi  model  (MIMM).  a  concept 
l  hal  was  described  in  |4..\ft|  I<>  del  me  I  Ins  eoncepl.  suppose  !  is  a  function  from  IK  into 

1  ■  •  and  th.il  the  following  equation  holds  for  all  v  belonging  to  a  subset  .V  of  IK 


(  ( (Ml’  \KISI  )N  ( >1  Ml \ I  I )  l\  1 1  ( ,1  l<  |  ( i|< Ml  |  \  [  |i  i\y 


( 1 .2)  / 1  \  )  ~  mm  n 

sublet  1  lo  h  =  />  fx. 

i  >■  t)  ,niil  t  integer  tin  i  <  I. 

vxhere  /  is  a  subset  of  il . n\.  h  is  an  element  of  IR".  anil  c.  T.  and  1  are  of  dimensions 

1  x  n.  m  x  n  and  m  x  I  respectively  (We  will  assume  dial  the  optimization  problem  on  die 
rbs  on  112)  has  an  optimal  solution  if  its  feasible  sei  is  nonempty,  and  that  the  "optimal  value" 
is  defined  to  be  +°°  if  the  feasible  set  is  empty  )  I  he  expression  on  the  rhs  of  (1.2)  is  said  to 
be  a  MIMM  tor  I' on  V  (for  our  purposes,  it  is  convenient  to  assume  dial  Vis  annex,  although 
for  the  general  theoretical  development  of  MIMM's  given  in  |4).  iliis  is  not  necessary).  As 
noted  in  1 4 1 .  the  utility  of  MIMM's  arises  in  part  from  the  fact  that  for  any  set  S  C  .S'  the  fol¬ 
lowing  two  problems  are  equivalent: 

<14)  min  /  (a  )  +  /  (  x.r  ) 

si  \  f.  .S'.  ( x.r )  £  /. 

and 

i  I  4)  mm  it  ■+■  /  I  v.r  ) 

si  x  •  V  (x.r)  €  /'. 

■fi  =  b  -  4.v. 

i  ^  0  and  y,  integer  for  i  6  I 

The  problems  <  13)  and  (I  41  are  equivalent  in  the  sense  that  <1.3)  has  a  feasible  solution  if  and 
only  if  <1  4)  has  a  feasible  solution,  and  (v*.r*)  is  an  optimal  solution  of  (1.3)  if  and  only  if 
there  exists  a  y*  such  that  (v*.y*r*)  is  an  optimal  solution  of  (1  4).  From  a  computational 
point  of  view  the  transformation  from  (1.3)  to  (1.4)  may  allow  the  replacement  of  a  piecewne- 
Itnear  objective  function  term  /  (.v>  by  a  linear  objective  function  term  a  .  Thus,  if  an  optimiza¬ 
tion  problem  has  only  linear  constraints  and  objective  function  terms  for  which  MIMM's  exist, 
then  this  conversion  procedure  may  be  carried  out  term-b\  -term  until  the  original  problem  has 
been  transformed  into  a  mixed  integer  program  ( M I P ) .  Note,  however,  that  although  this  MIP 
will  be  equivalent  to  the  original  problem,  equivalence  may  be  destroyed  if  the  integrality  con¬ 
straints  on  the  newly  added  variables  are  deleted,  a  relaxation  which  is  usually  the  first  step  of 
an  algorithm  for  the  solution  of  an  MIP  In  particular,  the  relaxation  of  the  integrality  con¬ 
straints  of  a  MIMM  will  yield  a  parametrically  defined  family  of  problems  (a  linear  programming 
minimization  model  <LPMM))  whose  optimal  value  must  be  <see  [4] )  a  convex  function  on  all  of 
IR1  Thus,  this  relaxation  will  mean  that  a  nonconvex  objective  function  term  of  the  original 
formulation  is  replaced  by  a  convex  approximation  In  algebraic  terms,  defining 

(1.5)  /*( \ )  =  min  it 

s.t .  A  i  =  b  -  A  x.  v  ^  0. 

it  follows  that  I*  is  convex  on  IR1.  so  that  if  /  <as  defined  in  <1.2)1  is  nonconvex  on  V  (in  the 
sense  that  there  exist  points  x,.  ,v.,  \  (  .Sand  a  A  £  (0,1)  such  that  \  =  Axy  +  <  1  -  A).v:  and 
/  <  v )  >  A  /  ( .v  | )  +  (I  a)  VO.  then  /  and  ./*  cannot  coincide  over  all  of  S  (in  particular,  they 
would  not  agree  at  v)  The  difference  /  <x)  /Tv)  (which  is  always  nonnegative  because  of 
the  relaxation  of  the  constraints)  will  be  termed  the  relaxation  error  of  the  I.PMM  at  x 

In  the  case  of  the  MIMM  (I  II.  for  example,  the  optimal  value  function  (for  x  ^  0)  for 
the  I.PMM  obtained  by  relaxing  the  integrality  constraints  of  (1.1)  is  easily  seen  lo  be  the  linear 
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I  mu  ii>>n  A'tvl  i  v  I?  I  lie  relaxation  emu  in  ilns  particular  ease  is  lluis  ihe  difference 
between  the  values  A  ( \  i  and  A  *  i  \  i  (see  I  igure  I  t  Note  dial  this  dilterenee  is  unless  v 

is  an  integer  multiple  id  12  U  ni  \  ■  0.  At\i  A‘(\l  -  (f.  hut  we  are  enneemetl  here  nnl\ 

with  nonnegative  values  of  v  l 

In  enmparmg  iillrnumx <  MIMM  formulations.  a  eomparison  ul  the  behavin'  nl  the  relaxa¬ 
tion  errors  establishes  the  relative  aevuraev  id  the  approximations  used  in  the  hrst  step  id  the 
solution  id  the  respeetive  Mil’s  Ihus.  d  /"is  the  optimal  value  function  ol  the  eonnnuous 
relaxation  ol  a  i  lilltnnl  MIMM  lor  /.  and  /'tv>  x  /  * '  <  v  )  lor  all  v  •  S  Ivvhieh  we  write  as 

/'  '  /").  then  the  MIMM  (1  2*  may  he  eonsiderevl  to  he  at  least  as  good  (with  respeet  to  the 

relaxation  etror  eritenont  as  the  MIMM  from  which  /"  was  derived  Moveover.  tl  it  can  he 

established  that  the  meviualitv  /'  •  /".  holds  lor  all  convex  lunctmns  /“  satisfying  /  /*'. 
then  the  MIMM  giving  rise  to  /'  will  he  optima!  Irom  the  standpoint  ol  error  n  a  relaxation 
solution  strategv.  anil  will  therelore  he  said  to  he  n  la\tiiiim-opiimal  on  S.  I  As  will  be  seen,  a 
lunetion  mav  have  more  than  one  relaxation-optimal  MIMM.  so  additional  MIMM  criteria  also 
will  he  considered  >  In  order  to  more  easily  dexenbe  results  of  this  type.  it  is  convenient  to 
introduce  some  additional  terminology  II  li  is  a  lunetion  mapping  a  convex  set  /  into 
I  «  .  *  >  I.  the  ,ii/iicv  mu  lope  ol'  li  on  /  < which  may  be  thought  of  geometrically  as  the  lai.iusi 
convex  function  hrhm  h  on  /  >.  denoted  by  ■  *  < //. x.  /  ) .  is  the  function  satisfying  the  relations' 

tint  i  '  l  v.  /  t  ‘i  1  \  I  lor  all  v  •  /. 

1  I  "  1  i  * (/;.  v.  /  )  is  convex  on  /. 

<18t  il  g  I  v  >  <■  /;  (  v  >  lor  all  v  *  /  and  e  is  convex  on  /. 

then  gt.vi  c‘  i h. /  I  lor  all  v  (  / 

•  In  places  where  reference  to  the  variable  is  not  needed,  we  will  write  <*(/;./)  in  place  of 
i*(/i.v,/  >  i  I  xistence  and  uniqueness  of  t '(//./  )  easilv  follow  from  the  fact  that  the  point- 
wise  xupremum  of  a  family  of  convex  functions  is  convex.  Defining  on  7  the  set  of  functions 

('(/j,/  )  =  Ic'jg  is  convex  on  /.  ,g  <  h\. 

i  "(li.f  I  is  simply  the  supremum  of  (  (/;./').  It  might  be  noted  that  the  domain  7"  plays  a  very 
significant  role  in  determining  the  convex  envelope  That  is.  the  value  of  the  convex  envelope 
at  a  particular  point  may  he  different  lor  different  choices  of  /  I  bis  aspect  of  the  convex 
envelope  will  be  taken  up  in  Section  2 

[Tie  optimal  value  function  of  a  l.l’MM.  in  addition  to  being  convex,  is  also  piecewise- 
lmear  HM  >.  and  it  is  also  convenient  to  introduce  some  terminology  for  piecewise-lineur  funr 
lions  ol  a  single  variable,  which  are  our  principal  concern  m  this  paper 

We  will  sav  that  a  real-valued  lunetion  h  defined  on  a  closed  interval  |a,,.o.,  I  C  IK 1  is  a 
[>ic<  (Hi\i-lii!itii  ftitii  non  mi  |o,,.it.|  mill  breakpoints  <>,,  <  <«  ,  <  <<  if  h  is  a  (tint'  on  each 

subinterv  ,il  |o  .  o  .  :  |  ami  |/;  to  .  1 1  li  to  )  |,  hr  . ,  *»  it  I  ^  |/t  In  )  h  (n  ,  1 1  to  n  1 1  lor 
/  I.  .  />  I  (that  is.  the  slope  to  the  left  of  <»  differs  Irom  the  slope  to  the  right  ol  n  I 

(he  basic  result  that  will  be  used  to  establish  that  certain  formulations  yield  convex 
envelopes  is  the  sufficiency  part  of  the  following  theorem 

1  III  OKI  M  I  l  et  g  be  a  lower  semi-continuous  fisc)  lunetion  mapping  |n,,.n  I  into 

(  no,  *m|  with  .c  (ft  )  <  4  ‘JO  lor  1  =  1),  .  ft 
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l  et  e ’  he  a  convex  piecewise-linear  function  on  |o„.o;,l  »nh  breakpoints  <*„  «"  » ,  < 
and  let  ,«r* ( \  t  $  ,g  <  \  )  lor  \  t  lo  I  A  necessary  and  sufficient  condition  for  g'  to 
be  the  convex  envelope  of  g  on  lu„.o,,l  is  that  „  *  <o , )  g  to  l  lot/  (I.  ...  .// 

Proof  lo  establish  \u(fiacnn.  suppose  that  g  *  (  <g.  !o,,.o,,| )  I  hen  for  any  \  ‘ 
l<>„.n;,l  there  exists  at  least  one  pair  <»  ..  o . . ,  of  breakpoints  such  that  v  |rt,./i1.i|.  Choosing 
A  t  It).  I  ]  such  that  x  =-  Ao:  +  II  A)o,.t,  we  haxe  (using  the  convexity  of  el.  j  I  \  i  C 
Ag  t<« , )  +  tl  a)  ,e  to  .  | )  ^  a  .do,)  +  tl  a)  .e  to  ,|t  A.g'fo.)  *  tl  a) 

e* to . .  j  >  =  ,e* t  \  )  I  bus.  ,e ( x )  §  g'f.v)  for  any  \  <  |o,, .<*.,!  establishing  that  e*  <’ 

<g.  |o,).or|) 

lo  show  necessity,  suppose  that  g<n,,)  ,e‘i« ,,>  >  0  Since  e  is  l.s.c.  and  g'  is  con¬ 

tinuous.  there  exists  a  Sid  tO.o,,)  such  that  o„  <  x  <  o,,  *  A,,  implies  etxl  -i  eto,,)  tI(  2 
and  e * < .x )  g*(<»n)  +  t,^2-  eto,,)  <-,*2 

Now  consider  the  PI  function  g  (see  I  igure  2)  with  breakpoints  at  bo.,.  ((l|  + 

ftn.  0| . o.,  and  function  values  eto,,)  -  eto,,)  2.  eto,,  *-<■>,,)  e'to,,  *  A.,).  £>(»,  )  - 

e*lo;)  1/  =  1 . /> )  Note  that  eto,,)  >  e'to,,).  but  that  gfvl  -  g't.v)  for  x 

|o„  +  and  that  g  is  a  convex  function  on  |o I  finally,  the  relations  eto,,!  “ 

e -  «</2  and  gto-n  +  A,,)  =  g*(o,>  + Au)  .<  e'<<»„)  *  2  -»  eto,,)  to  2  imply  g(,\i  < 

e  -  «,/2  for  x  €[(»,,.  o,i  +  b|.  so  that  g(\)  <  e<x)  for  x  (  |o,,.o/.l  Thus, 

e I  v )  €  Ctg(.v),  |(»„.o(,|)  and  .eto,,)  >  e'to,,).  contradicting  the  hypothesis  that  e'<x) 
i  *(g..v.  |oi,.o,,lh  A  contradiction  may  be  similarly  obtained  if  g<of,  ■  >  e'to,.)  I  or  an  interior 
breakpoint  it,,  the  construction  of  a  suitable  g  is  similar  < see  figure  3).  except  that  the  break¬ 
points  of  ,e  (where  it  coincides  with  g')  are  taken  to  be  o . <<  |.  o  A  .  - 

A  ....  <ir.  where  0  <  A,  <  min  |o  o  ,.o  . ,  ,,  I  is  chosen  so  that,  defining 

t  =  e<o, )  -  e*to.)  >  0.  we  have,  for  x  (  |,>  A  .<»  -  A  |.  the  inequalities  g  <  \  I  ->  gto  ) 

t  ;2  g*(v)  <  g*to,»  +  «,/2  Because  of  the  change  in  slope  at  breakpoints,  it  may  be  verified 

that  e'to  )  <  gto,).  and  thus  a  contradiction  may  be  obtained 

Note  that  lor  sufficiency,  lower  semi-continuity  ,,f  e  is  //,,/  required  In  this  paper  we  are 
primarily  concerned  with  the  sufficiency  p.irt  of  this  theorem,  but  it  should  be  noted  that  in  |4| 
the  lower  semi-continuity  I  optimal  value  functions  of  MIMMs  was  established  under 
rationality  assumptions  on  the  coefficients  of  the  MINIM 

It  might  also  be  noted  that  the  argument  used  in  the  proof  can  be  used  to  show  that  e 
does  mu  have  a  PI  convex  envelope  if  e<<>  ,)  =  t  °  or  g  <««  ..)  =  since  this  would  mean 

that  e'to  a  <  gto,,)  or  e'to  i  <  gto  )  for  any  PI  function  g'.  On  the  other  hand,  a  PI.  con¬ 
vex  envelope  may  exist  if  there  are  mivnot  points  x  of  |o,„o  |  with  the  properly  that 

gt  x  )  =  I  fus  allows  the  domain  ol  g  to  have  "gaps"  on  which  g  may  he  thought  of  as  being 

•  Such  gaps  often  occur  in  optimal  value  funefions  of  MINI  M’s 

f  ram  figure  1.  one  might  conjecture  that  A'  is  the  convex  envelope  of  k  on  K  [  lhis  is 
indeed  true,  and  m  Section  2  we  will  use  the  approach  of  I  heorem  I  to  establish  a  more  gen¬ 
eral  result  from  which  this  follows  as  a  special  case 

2.  Tin:  l  NBOl  NDFI)  (  ASF 

In  this  section  we  will  consider  MIMM  s  lor  a  broad  class  ot  economy -ot -scale  functions 
that  includes  the  economy -of -scale  function  Af\i  of  the  previous  section  Spevitic.il  I  v .  we  will 
develop  relaxation-optimal  MIMM  s  lot  the  class  of  functions  whose  elements  may  be 
represented  as  optimal  value  functions  ot  the  following  type 


(2  11 


/,( v  i 


min  i  i 


s.i.  <:}  ^  v 

i  >  0,  v,  integer  for  /  €  /. 

where  i  =  < i  | . i  ,,)>  0.  </=(</ , . a,.)  >  0, 

anil  /  is  a  subset  of  1 1 . n). 

(  Ihe  ease  in  which  there  are  <  =  0  is  not  ot  economic  interest,  but  is  included  for  mathemati¬ 
cal  completeness  I  he  sign  restrictions  on  <  and  </  do  serve  to  guarantee  the  existence  of  an 
optimal  solution  for  all  v.  hut.  as  shown  in  Appendix  A.  could  be  replaced  by  this  hypothesis 
In  the  next  section,  where  hounds  on  the  i  are  assumed,  it  will  be  seen  that  these  sign  restric¬ 
tions  have  greater  significance  I  Note  that  the  class  of  functions  representable  in  the  form 
(2  It  includes  lixed-charge  functions  and  economy -ol-scale  functions  allowing  several  dilh'rcni 
volume  discounts  (as  opposed  to  only  one  in  the  case  of  A(vi)  (The  computational  results  in 
Section  s  deal  with  an  example  in  which  n  -  .1  l  for  notational  convenience  we  will  assume 
that  the  variables  have  been  ordered  so  that 

(2  2>  i  ,/it,  =  r,  $  t  ./  •  H  r  ■  4  ...  $  <•„/</„  =  r,  . 

from  a  cost  viewpoint,  l h is  means  lhat.  on  as  /nr  uml  basis,  (he  most  "economical"  purchase 
quantity  is  </,.  the  next  most  economical  is  a  >.  etc  .  and  the  right-hand  side  x  represents  the 
minimum  amount  to  be  purchased 
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(  onsidei  the  continuous  relaxation  of  the  MINIM  m  (2  I  ».  which  tins  the  optimal  xalue 
I  mu  i  ion  defined  hi 

(  2  .'I  /  ‘  1 1  x  i  =  min  i  i 

v  I  u  t  x .  t  e  II 

I  he  lolloping  lemma  slates  that  /*  is  linear  on  R],  and  pros  ides  the  basis  lor  a  proof  of  the 
lelaxation-opumality  ot  the  MINIM  on  the  RIISol  (2  1) 

I  I  M  N1 A  1  lor  a  t  IK  I .  / ;  (  x  )  =  r ,  x 

I’ROOf  Note  that,  lor  any  \  2  0,  the  dual  ot  (2  3l  may  be  written  as 
(24)  max  x  v 

s.t.  vo  u  i.  v  >  0. 

Ux  selling  i’  =  \  i/  ami  i  *  --  vt  =  ...  -  t  7  "  0  anil  x*  ■  /  .  xxe  obtain  primal  and  tlual  feasi¬ 
ble  solutions  with  common  objeetixe  function  value  r,.x.  Ibis  is  thus  the  optimal  value,  /  *<  v  I 

Maxing  obtained  a  closed  form  representation  of  /‘fxl.  the  relationship  between  t,  and  /*  is 
easily  established. 

INI OKIM  2  The  following  relations  hold  between  /.  and  /*: 

( 2  '  i  / .  (  a  I  =  J  *(,v  )  l  or  a  =  A  ■  a  i  ( A  =  0.  1  .  .  .  ) 

i  2  M  /  *  =  (*!/,.  IK  ! ) . 

I’KOOl  Since  /*<v  )  ^  /|(.v)  for  x  €  IR.1,  (2.5)  may  be  established  by  showing  that,  for 
x  A  ii  i  (A  =  0.1.2.  (2.1)  has  a  feasible  solution  with  objeetixe  function  xalue 

/ *t  Au j )  -  r,  •  Am |  =  Ai  |.  Such  a  feasible  solution  is  obtained  by  setting  y  i  =  A  and  i;  = 

t  ,  =  v,  =  0. 

To  proxe  (2.61.  it  suffices  to  show  that  for  any  v  €  IR).  there  exist  \|.  ,v>  €  R)  such 
that,  for  some  A  f  (0.  ll.  we  have  A  \q  +  (0  -  A  I  ax  =  a  and  ./*< v  )  =  A ,/ 1 <-V|  )  + 
(J  A  )  /  j  f  \  i ) .  since  any  /  6  t  *(/j.  IR.1 )  must  satisfy  ./ (a  )  ^  A /j  (.\|  )  +  (l-A)/j(.\0. 
these  quantities  are  obtained  by  taking  ,v|  =  0.  Ay  =  A  •  «|.  where  A  is  an  integer  chosen  such 
that  Am |  Sr  x  ,  and  A  such  that  ( 1  —  A  )Am|  =  v  Then  A  / , (a q  )  +  ( 1  -  A  )  / j  (aw  )  =  0  + 
l  1  A  I  r ■  |  Am  i  =  r  |  A  =  /  *(  a  ) . 

It  is  of  some  mathematical  interest  to  note  that  the  constraint  mi  >  v  in  (2.3)  is  satisfied 
as  an  equality  by  an  optimal  solution  of  (2.3).  The  observation  max  be  used  lo  establish  that  I* 
is  also  the  convex  envelope  on  IR  !  of  the  optimal  value  function  in  the  corresponding  equatiiy- 
i  unstrained  case 

(2.7)  /',  (a  )  =  min  u 

s  i.  mi  =  v 

t  2  0,  i  integer  for  i  c  /. 

I  his  result  follows  since  /jl\)  =  /f(v)  for  v  =■  A  a ,  ( A  =  l).  I  .  .  .  .  >  Since  t  *  may  be  writ¬ 
ten  in  the  form  (2  3)  with  the  constraint  mi  2?  \  replaced  by  mi  =  a.  it  follows  by  the  analog  of 
Theorem  2  that  the  modified  MIMM  is  relaxation-optimal  in  the  equality-constrained  case  (2.7) 

as  well 
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On  iIk  other  hand,  it  is  not  always  possible  to  establish  relaxation-optimality  il  a  positive 
i oimani  is  added  to  the  RMS  of  the  constraint  with  RMS  \  in  <2  I )  (negative  constants  pose  no 
difficulty.  as  we  will  show  in  Section  3)  All  example  illustrating  the  difficulties  that  may  arise 
in  this  ease  is  given  in  Appendix  B  However,  it  is  possible  to  extend  the  results  of  this  section 
to  the  case  in  which  nonnegative  hounds  are  imposed  on  the  variables  This  case  is  taken  up  in 
Section  3 

(  mails,  in  the  case  that  the  a  are  all  rational.  'theorem  I  is  a  special  case  ol  a  result  ol 
Blair  and  Jeroslow  |2|.  who  considered  a  system  of  constraints  and  showed  that  the  convex 
envelope  of  the  optimal  value  function  of  the  MIV1M  (for  v  f  IR  ) 

(2X>  miner 

s  t.  h  >  r  >  (I.  r  integer  for  i  /. 

coincides  with  the  optimal  value  function  of  the  continuous  relaxation  of  the  MIMVt  The 
thrust  of  the  next  section  can  thus  he  viewed  as  an  extension  of  this  result  to  certain  cases  in 
which  nonzero  < omtants  aie  allowed  in  the  constraints  of  12.8).  Iln  general  the  Blair-Jeroslow 
result  does  noi  extend  to  the  nonhomogeneous  ease,  as  may  be  ascertained  from  the  examples 
in  Section  B  ) 


3.  BOUNDS  ON  > 

l-'or  most  integer  programming  codes,  it  is  necessary  to  have  bounds  on  the  integer  vari¬ 
ables  If  the  range  of  the  i  variables  in  (2.3)  is  restricted  by  the  imposition  of  bounds,  then  the 
corresponding  optimal  value  function  on  IR .  is  pieccwise-linear  (where  it  is  Unite),  but  the 
relaxation-optimality  property  of  Section  2  may  nonetheless  be  extended  to  this  case  We  first 
consider  the  case  of  upper  bounds,  and  then  the  case  of  upper  and  lower  bounds  As  in  Section 
2  we  assume  that  c  >  0  and  a  >  0.  (By  making  some  obvious  extensions,  the  constraint 
u  >  0  may  be  removed,  but  as  may  be  seen  from  an  example  in  Appendix  B.  sign  restrictions 
on  i  are  needed  in  the  hounded  case  to  guarantee  relaxation-optimality.) 

Specifically .  instead  of  the  MIVf.M  in  (2.11  we  first  consider 
<3.1 )  /,(a  )  =  min  cv 

st  m  >  v 
0  <  v  ^  u 
y,  integer.  /  6  / 

where  c  0,  a  >  0,  the  ordering  assumption  (2  2)  is  assumed  to  be  satisfied,  and  the  u,  are 
nonnegative  constants  with  u,  integer  for  ;  6  /  To  prove  relaxation-optimality  we  will  show 
that  the  convex  envelope  of  /\  on  /)  =  lO.uul.  denoted  by  is  given  by  the  optimal 

value  function  of  the  continuous  relaxation: 

(3.2)  /*(.< )  =  min  cv 

s  l  ay  -?  v 
0  <  v  H  ii. 

(We  are  not  concerned  with  v  >  an  since  /  . ( v )  =  /*(.v)  =  +oo  for  such  a.) 
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lor  noiannna!  ion vemence  in  staling  .1  dosed  form  expression  for  /t(\).  we  make  the 
following  definitions. 

h  =  £  ii  h  .  </  ==  £  e  1#  ( /  ■  I) . n  ) . 

1 

where  11  is  understood  that  =  0  and  i/„  ---  0 

I  he  following  is  llie  analog  of  l  emma  I 

1.1  MM  A  2  /*,l.i  I  -  r  (  \  -■/>•+  </,  for  h  <  \  h  . , 

(./  =  0 . <1  I  I 

I’ROOL  The  proof  is  analogous  10  that  of  l  emma  I  l-or  any  v,  the  dual  of  <2  7)  is 
given  by 

max  i.v  -  wu 

s.l.  va  -  w  $  i  .  1  >  0.  u  ^  0. 

In  addition,  for  any  v  t  I).  the  optimal  solutions  of  the  primal  and  dual  problems  are  as  follows 
if  h  <  a  <  h  . set  1  *  -  11,  for  1  ^  ./.  set  .1  *,  =  I)  for  i  >  /  +  1.  and  choose  ,r  .  1  such  that 
t/i  *  =  a;  set  f*  =  x  .  |.  a  *  «=  r  .  1  a,  -  r,  for  1  $  /,  and  »•*  =  0  for  i  >  /. 

Note  from  l.emma  2  that  the  breakpoints  of  /  are  contained  in  the  set  |h„ . />„).  By 

applying  Theorem  I.  we  can  obtain  the  following  analog  of  Theorem  2: 

TIILORLM  .V  The  following  relationships  hold  between  /:  and  /V 

(33)  /i(.v I  =  /M.vl  if  v  =  ft,  (./  -  0 . //). 

(3.4)  ./*.'=  c  *(/:./2). 

PROOI-:  The  relation  < 3 . 3 )  follows  from  considering  the  feasible  solution  with  y*  =  11 
for  1  <  /  and  1 *  -  0  for  i  >  1.  The  relation  (3.4)  then  follows  directly  from  (3.3)  and 
Theorem  I 

In  a  branch-and-bound  algorithm  in  which  the  y,  are  used  as  the  branching  variables,  the 
formulation  <3.1 )  has  the  additional  very  nice  property  of  yielding  a  relaxation-optimal  formula¬ 
tion  at  cat  It  node  in  the  tree,  since  relaxation-optimality  is  not  affected  by  the  imposition  of 
additional  integer  upper  and  lower  bounds  on  the  1,  in  (3.1).  This  is  because  introduction  of 
nonnegative  lower  bounds  is  equivalent  to  the  addition  of  a  ncyatiw  constant  to  the  R 1  IS  of  the 
constraint  a\  ^  v  Since  a  constraint  of  the  form  ay  ^  ,v  -  y.  where  y  ^  0  implies  an 
optimal  value  of  0  for  v  r  [O.yl  in  both  the  corresponding  MIMM  and  its  relaxation,  it  is  easily 
shown  that  a  translation  of  variables  leads  to  the  following  result  (see  Appendix  C  for  details): 

COROLLARY  1:  Lor  v  ^  0.  let 
/  |( a  )  s  nnn  1 1 

\.l.  <n  ;?  \ 

I  1  <  11 


1,  integer.  1  t  /. 
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where  /  ■?  0  anil  /,  and  u  are  integer  lor  it  I:  then  the  M1MM  is  relaxation  optimal  on  mv 
interval  |o  .a it  I.  where  «►  t  |0.u/| 

In  the  next  two  sections  we  will  compare  these  results  to  a  "standard"  approach  to  formu¬ 
lation  that  yields  relaxation-optimal  MIMM's  for  quite  general  piecewise-linear  functions 

4.  AN  ALTERNATE  APPROACH 

A  standard  and  quite  general  approach  to  modelling  continuous  piecewise-linear  noncon- 
vex  functions  is  to  employ  the  so-called  "A  formulation"  of  separable  programming  with  the 
additional  restrictions  that  at  most  two  a,  are  allowed  to  be  positive  and  that  these  must  be 
"consecutive."  We  will  see  that,  while  this  approach  also  yields  relaxation-optimal  models,  it 
can,  in  contrast  to  the  approach  of  Section  3.  lead  to  computational  difficulties  in  the  absence  of 
special  provisions  for  handling  the  variables. 

Assume  that  /  is  a  piece  wise-linear  function  on  (o,,.o;i|  with  breakpoints  o(l  <  n,  < 

.  .  .  <  o,,.  Ut  is  possible  to  deal  with  l.s.c.  "piecewise-linear"  functions  by  a  slightly  different 
formulation  technique  (see  |4|).  hut,  aside  from  the  need  for  more  complex  notation,  the 
results  are  essentially  the  same.)  Consider  the  following  M1MM  for  /: 

r 

(4.1  >  /  (,v)  =  min  £  /'( o,)A, 

*  A.  , 

/’ 

v.r.  £  <1,A,  =  ,v 
r 

£  A,  =  1.  A,  ^  0  (I  =  0 . />) 

I  0 

A|)  ^  ft,, 

A  I  ^  fti,  4-  ft, 


A  (1  •  ^  ft  r  .  +  ft  „  I 
^  l 

/' 1 

£  ft,  ~  l.  ft,  ^  0  and  integer  U  =  0 . p  -  U 

i  -u 

and  let  /*  denote  the  optimal  value  function  corresponding  to  the  continuous  relaxation  ol  the 
RHSofU.l).  Note  that  ./*  e  (’(,/. 

THEOREM  4;  The  MIMM  on  the  RHS  of  (4.1)  is  relaxation-optimal  on  l<m. 

PROOF:  Let  v  <  and  let  A  be  chosen  so  that  /*(_\  >  is  obtained  by  setting  A,  =  A, 

in  the  corresponding  LPMM.  so  that  7*(.x)  =  £/(«■,)  A,  ^  *(._/,«, .lou.«p] •  A,  ^ 

i ’(/.  v.lu I .  Since  /*  t  ('</.  [<«„,«, ,1).  this  implies  that  / * <-x  )  =  i  *(./.. v,  (<*<>. <*,,!>  and  the 
conclusion  follows. 


While  Theorem  4  implies  that  the  standard  MIMM  will  also  be  relaxation-optimal  lor  a 
amimitniis  economy-of-scalc  function  in  the  class  considered  in  Section  3,  the  MIMM  <4.1 1  has 
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several  compulation.il  disadvantages  One  obvious  disadvantage  is  its  sheer  si/e,  since  the 
number  of  constraints  and  variables  m  <4  11  is  determined  by  the  number  of  breakpoints  of  /. 
whereas  this  is  not  the  case  for  the  lormulations  of  Sections  2  and  .V  A  more  subtle  disadvan¬ 
tage  is  the  failure  ol  the  integer  variables  ft  of  (4  ll  to  directly  reflect  physical  quantities.  In 
particular,  theft  all  have  vost  coellicients  ol  (I  and.  moreover,  a  0  "branch"  on  aft,  has  no  effect 
on  the  allowable  range  of  v  values  unless  it  has  the  largest  or  smallest  index  of  any  ft,  not  yet 
lived  W  hile  these  disadvantages  may  be  alleviated  via  the  use  of  "Special  Ordered  Set"  (SOS) 
strategies  for  branching  (see  |1|).  such  strategies  are  often  not  available  in  MIP  codes  (see  If]). 
In  particular.  SOS  strategies  are  not  fully  implemented  on  the  I'nivae  I'MPS-MIP  code  in  use  at 
the  Madison  Academic  Computing  (  enter,  and  in  the  next  section  we  compare  results  obtained 
with  IMPS  and  the  formulation  approaches  of  Section  3  and  4  (It  should  be  noted  that  the 
use  of  an  SOS  strategy  has  the  advantage  of  imposing  disjoint  upper  and  lower  bounds  on  the 
range  of  the  variable  v  in  (4  1  )  when  SOS  branching  is  performed.  Branching  on  the  i,  in  (31) 
imposes  upper  hounds  on  v,  but  does  not  directly  impose  lower  bounds  Lower  bounds  on  the 
tange  ol  v  may  be  directly  imposed  by  adding  to  (3.1)  constraints  of  the  form 

v  5  ay  i'i:. 

plus  additional  constraints  of  the  form  r  ^  t  .  By  selecting  the  coefficients  a  to  reflect  max¬ 
imum  "surpluses"  so  that  for  any  v  t  [(),</!/ 1.  a  f  yielding  an  optimal  solution  to  <3.1 1  for  a  =  v 
will  satisfy  v  >  ay  -  a:  for  some  feasible  r.  relaxation  optimality  wall  be  preserved  This  lol- 
lows  easily  from  the  fact  that,  by  assumption,  (he  optimal  value  function  of  the  MIMM  remains 
M.v  ).  while  the  optimal  value  of  the  continuous  relaxation,  which  cannot  increase  beyond 
i  *<  t  .  I  >  (in  spite  of  the  added  constraint)  must  also  remain  the  same  Some  theoretical 
and  computational  aspects  of  such  lower  hound  constraints  as  well  as  some  oi.ier  modelling 
refinements  to  deal  with  upper  bounds  on  \  are  currently  under  investigation  > 

5.  A  COMF1  r.VHONAI.  COMPARISON 

In  this  section  we  consider  a  comparison  of  solution  times  for  different  formulations  ol 
the  following  communications  network  problems,  determine  the  minimum  cost  network  (see 
fable  1)  that  meet  specified  demands  (see  Table  2)  between  six  distinct  pairs  of  cities  (  A.B). 
<A.C).  (A.D).  (B.O.  (B.D).  and  (CM)),  where  the  communication  traffic  between  the  ele¬ 
ments  of  a  city -pair  may  be  routed  via  any  acyclic  path  between  the  cities  (there  are  5  such 
routes  between  each  city-pair) 


IABLI  1  (  Os  t.s 


;  Arc 

Single  Channel 

1  2  Channels 

60  Channels 

1  A-B 

784.?s 

7028.77  ^ 

17690.40 

B-C 

87X  2a 

7442.07 

21341.47 

(  -1) 

1407  70 

13232.38 

42512.54 

!)-  A 

634  40 

5647.63 

13098.00 

D  B 

1 04s  60 

4619.52 

28022.08 

(  A 

1  1 236  7 7 

i  . J 

11500.10 

35860.53 

I  \Bl  l  2  Two  Sets  of  Communications  Demands 


(Tty-  Pai  r 

Demand  Set  1 

Demand  Set  II 

A-B 

2 

4 

B  ( 

10 

10 

(  1) 

46 

64 

DA 

5 

5 

I)  B 

2 

10 

(A 

|  4 

14 
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Algebraically,  this  problem  has  the  form 

»« 

mm  £  h, (  v, ) 

'  .  i 

s  t  £  j  .  =  tik  ( A  =  I  .....61 

i 

X  :,k  =  >,(;=  I . 6) 

HI,  i 

V  0. 

where  ;,k  represents  the  number  of  channels  on  the  path  between  the  A!"  city-pair.  r/,  is  the 
total  number  of  channels  needed  by  the  k'"  city-pair,  .4,  is  the  set  of  pairs  (/.A  )  such  that  the 
corresponding  path  uses  arc  i.  \  is  the  total  number  of  channels  on  arc  i.  and  h  tx,)  is  the 
minimum  cost  of  leasing  at  least  v  channels  on  arc  /.  (Note  that  the  /;,  are  economy -of-scale 
functions  of  the  type  considered  in  Sections  2  and  3  with  n  =  3.  For  computational  conveni¬ 
ence  the  variables  associated  with  single  channels  on  arcs  were  assumed  continuous.  Because 
of  the  fixed  demands,  bounds  could  be  imposed  on  all  variables.  General  integer  variables  were 
decomposed  into  0-1  variables,  since  the  FMPS-MIP  code  requires  this. ) 

The  computational  results  of  Table  3  illustrate  the  dramatic  difference  in  solution 
behavior  and  times  between  the  formulation  approaches  of  Sections  3  and  4  The  M1P  code 
used  was  the  l.'nivac  FMPS-MIP  code  (level  7RI)  and  the  problems  were  run  on  the  Madison 
Academic  Computing  Center  linivac  1110.  Tor  demand  set  I,  the  Section  3  formulation 
requires  only  about  1/4  the  computer  time  of  the  Section  4  formulation.  For  demand  set  11. 
the  solution  time  for  the  Section  3  formulation  is  15  seconds,  whereas  the  FMPS  system  was 
unable  to  solve  the  Section  4  formulation.  Similar  behavior  was  observed  in  runs  using  a 
locally  developed  MIP  code.  IPMIX1).  w'hich  successfully  solved  both  l-S  and  ll-S.  but  failed  to 
solve  either  l-L  or  1 1  -  L  because  of  storage  overflows. 


TABl.F  3  Problem  Size.';  and  Solution  Times 


Problem 

Rows 

Columns 

0-1  Variables 

Solution  Time  (Sec.) 

l-S* 

12 

54 

fs 

4 

I-L+ 

76 

122 

40 

15 

II-S 

12 

60 

24 

15 

1 1  -  L 

116 

202 

80 

t 

‘I  denotes  dcm.ttn!  set  I.  S  denotes  “short"  lurin'*1  Mum 
I  denotes  "Imig"  standard  h  mi  hi  non 
t  I  All’S  svstem  I’m  ced  termination  <*t  run  mill  message  "mimenc.il  errors" 


A  number  of  other  versions  of  the  problems  were  run  in  which  some  of  the  cost  function 
terms  were  modelled  via  the  Section  3  approach  and  the  remainder  via  the  Section  4  approach. 
In  all  cases  the  results  were  worse  then  those  obtained  via  the  Section  3  approach. 


6.  CONCLUSION 


For  piecewise-linear  functions  belonging  to  a  broad  class  of  economy-of-scale  functions,  a 
compact  mixed-integer  programming  formulation  has  been  described.  This  formulation  was 
then  shown  to  behave  at  least  as  well  as  any  other  mixed-integer  formulation  of  the  function  in 


i:s 
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terms  i't  i ho  approximation  error  resulting  from  the  relaxation  of  integrality  constraints.  More- 
oxer.  a  computational  comparison  (using  a  communications  network  problem  as  a  test  problem) 
showed  the  superiority  of  the  compact  formulation  over  a  standard  mixed-integer  formulation 
ol  the  same  problem 
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APPENDIX  A 

Jo  justify  the  statement  in  Section  2  that  the  restrictions  a  >  0  and  <  >  I)  can  be 
replaced  by  assuming  that  (2.1)  has  an  optimal  solution  for  v  >  0,  we  consider  the  remaining 
cases  ( I )  /  such  that  c,  ^  0  and  a  $0(21  /  such  that  t.  <  0  and  a  >  0,  and  (3)  /  such  that 
<  <  0  and  a  <  0. 

CAST.  I:  for  those  i  such  that  i,  ^  0  and  a  $  0.  one  may  obtain  an  equivalent  problem 
by  deleting  the  corresponding  variables  i  from  the  problem,  since,  for  any  \  >  0.  an  optimal 
solution  may  be  obtained  in  which  such  y.  =  0 

(  ASP.  2  If  there  are  /  such  that  i  <  0  and  u,  >  0.  then  clearly  the  objective  function  of 
(2  D  must  be  unbounded  from  below,  so  this  case  is  ruled  out  by  the  existence  of  an  optimal 
solution 

CAST.  3  If.  for  some  i.  <  <  0  and  a,  <  •).  then  either  all  a  $  0,  in  which  case  (2  1)  is 
infeasible  for  \  >  0.  or  there  exists  at  least  one  /  such  that  u,  >  0  In  the  latter  case  let 
r'  ~  min  Icy/u,  lu,  >  ()|  and  r  H  max  !(,,/</» '<i  <  0.  u(  <  ()|  If  r  $  r',  then,  assuming 
that  the  variables  are  ordered  so  that  a[  >  0  and  c, /<i,  =  r ' .  it  may  be  seen  from  obvious 
extensions  ot  the  proof-  of  Lemma  I  and  Theorem  2  that  the  desired  result  holds  On  the 
other  hand,  if  r  >  r',  then  the  objective  function  of  (2  1)  is  unbounded  from  below  for  all  v 
This  follows  by  letting  r‘  =  <  ,/U|  and  r  =  noting  that  c ,/-c,,  <  «|/-a(1,  and  choosing  a 

rational  «  >  0  such  that  <  <  «  <  cit/  -ap.  from  which  it  follows  that  a,  +  a,,«  >  0  and 

i  v  < <  0  Now  choose  an  integer  M  >  0  such  that  is  integer  and  note  that  the  rela¬ 
tions  a  ff  *■  a,,  \1H  >  0  and  <  ,.W  +  M0  <  0  imply  unboundedness 
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APPENDIX  B 

Here  we  consider  several  examples  to  illustrate  the  difficulties  that  can  arise  when  one 
attempts  to  extend  the  results  of  Sections  2  and  3  by  either  (I)  inserting  a  positive  constant  on 
the  RU-S  of  the  constraint  involving  x.  or  (2)  relaxing  sign  restrictions  in  the  bounded  case,  or 
(3)  allowing  more  than  one  constraint  involving  \  in  the  bounded  case 

The  following  illustrates  the  difficulties  that  may  arise  when  a  positive  eonstam  appears  in 
the  RUS  of  a  Ml  MM  (see  figure  4): 

A  i  < -V  )  =  min  r ,  t  I  Or  > 

s.t.  r |  +  1  2  r  -  \  +  10 

r, .  r i  ,5  0.  y,  integer. 

In  this  case,  the  convex  envelope  of  A  ,  ( ,v  >  on  R  !  is  easily  seen  to  have  a  value  of  It)  on  10,21 . 
so  that  it  does  not  coincide  at  x  =  0  with  the  optimal  value  function  of  the  continuous  relaxa¬ 
tion  of  the  MIMM  as  given  by: 

k  *(.v )  =  min  i  i  i  10m 

s. t.  \  |  +  I  2r  ■  ^  v  +  10 


since  A‘,(0)  =  10  •  -jy  <  10. 


I  ii .1  m  4  A  ,( >  i  mi  |ll,  1 4 1 


It  K  Mi'll  K 


I  'll 


Note  also  i hat  ihe  addition  of  hounds  dues  not  help,  since  defining 
k  A\)  =  nun  y,  +  lOy . 

s.t  y,  +  I2y,  >  v  +  10 
0  ^  y,  «£  10 
0  $  y,  s?  1 
y>  integer 

fields  A  | ( V )  =  f  .l.vl  for  v  C  [0,  121.  and  A,(x)  coincides  with  its  convex  envelope  on  (0,121, 
whereas  the  optimal  value  function  of  the  continuous  relaxation  is  again  strictly  less  than  A  ,(.v) 
at  v  =  0 

Now  consider  the  following  example  in  which  a  RHS  constant  is  not  present  in  the  con¬ 
straint  involving  a,  but  there  are  negative  coefficients: 

A, (a)  =  min  — y,"  +  10y. 

'  i  1 

s.t.  -y  |  +  12y,  >  ,v 
0  <  y  |  t?  10 

0  ^  y,  «S  1 
y  >  integer 

Making  the  change  of  variables  y[  —  10  —  y,  we  have 
At(.v)  =  -10  +  min  y,  +  10y. 

s.t.  y i  +  1 2y '  2?  a  +  10 
0  ^  y,  ^  10 
0  <  y  i  j?  1 
y,  integer, 

so  that  A  ifv  )  =  -  10  +  A  .(a  ).  It  is  easily  seen  that  while  A  (  coincides  with  its  convex  envelope 
on  (0,121.  it  differs  from  the  optimal  value  function  of  the  corresponding  continuous  relaxation 
at  v  =  0 

In  our  last  example,  we  consider  the  case  of  two  constraints  with  positive  coefficients  and 

rhs  v: 

Aj(a  )  =  min  >  |  +  y. 

s.t.  2y i  +  4c.  ^  a 
4y,  +  3y.  ^  a 
0  sS  y,.  y '  <  1 
V  i.y.  integei 


S' 
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In  this  case  the  optimal  value  function  is  finite  for  x  $  6.  and  is  easily  seen  to  have  the  values 
(I  for  v  =  (I 

A  4 ( v  )  -  1  for  0  <  v  ^  3 
2  for  3  <  x  ^  6 

Thus,  the  convex  envelope  of  A4  on  [O.hl  is  simply  v/3  On  the  other  hand,  lor  x  =  3  the  con¬ 
tinuous  relaxation  of  the  above  MIMM  for  A,  is  easily  seen  to  have  optimal  value  3/2  for  x  =  5 

(choose  C|  =  — .  y  >  =  I),  and  therefore  it  does  mu  coincide  with  the  convex  envelope,  which 
has  value  5/3  at  x  =  3 

appendix  < 

We  wish  to  establish  relaxation-optimality  in  the  case  of  both  upper  and  lower  bounds  as 
considered  in  Corollary  1  Define 

(C.  1 )  /,(.v  )  =  mm  ci¬ 

st.  ay  ^  ,x  . 

/  ^  I  <  u 
y,  integer,  /  €  / 
and 

(C.2)  /*(  v  )  =  min  cy 

s.t.  ay  >  v 
I  ^  y  ^  a . 

where  /  >  0  and  /,  and  it,  are  integer  for  i  €  /.  By  making  the  substitutions  y  =  r  +  /, 
x  =  i  +  al.  and  Cl  =  u  —  /,  we  have 

1 1(  x  I  =  cl  +  mm  c: 

s.t.  az  ^  l.  0  <  z  ^  0,  z,  integer,  i  €  / 

=  cl  +  /?(r)  =  cl  +  fAx  -  al). 

where 

I  At )  ~  min  tz 

s.t  az  ^  t.  0  <  ;  ^  «,  z,  integer,  /  e  /. 

Similarly  /fix)  =  cl  *■  i'Ax  -  al)  where 
t'  An  --  nnn  cr 

si  a:  ^  i,  (K  :  ^  u. 
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ABSTRACT 


In  lh^  work  we  consider  spread  of  informalion  which  motivates  rhc  hearer 
lo  perform  some  specified  action  I  he  lime  lo  complelion  of  an  action  is  as¬ 
sumed  to  be  a  random  variable  and  the  main  focus  is  on  the  number  of  com¬ 
pleted  actions  by  time  /.  Vtrl  Some  models,  which  reflect  different  degree  of 
centralization  in  the  spread  process,  are  analyzed  and  the  distribution  of  Vf/f, 
as  well  as  that  of  some  other  stochastic  processes  ol  interest,  are  obtained  The 
relevance  to  propagation  of  epidemics  is  pointed  out 

All  models  are  solved  by  employing  two  interrelated  concepts,  namely,  the 
order  statistics  property  of  stochastic  processes  and  the  binomial  closure  proper¬ 
ty  of  collections  of  distributions  In  this  respect,  the  work  also  serves  as  an 
illustration  of  the  application  of  these  useful  concepts 

I.  INTRODUCTION 

In  this  work  we  shall  consider  several  spread  of  information  models.  While  the  term 
information  is  meant  in  a  broad  sense  we  are  particularly  referring  to  messages  which  motivate 
the  hearers  to  perform  some  specified  action.  This  could  be  a  marketing  leaflet  which  stimu¬ 
lates  the  reader  to  buy  some  commodity  or  a  military  call  up  order  which  requires  the  report  of 
its  recepienl  at  some  predetermined  place.  The  spreading  itself  could  be  carried  out  by  a  single 
spreader  (possibly  a  source),  by  means  of  a  hierarchy  of  spreaders  or  by  anyone  who  has  heard 
the  information.  The  models  which  will  be  discussed  in  this  work  corresponds  to  this  varying 
degree  of  centralization  in  the  spread  process. 

All  models  start  with  a  single  initial  spreader  —  having  more  than  one  would  merely 
require  convoluling  the  results  —  and  the  spread  rate  is  always  of  a  homogeneous  Poisson  type. 
The  time  to  completion  of  the  specified  action  is  assumed  to  be  a  random  variable,  independent 
from  hearer  to  hearer,  with  a  general  cumulative  distribution  function  //(•).  It  should  be  noted 
that  an  action  need  not  involve  physical  efforts  and  may  even  be  instantaneous  so  that  //(•)  is 
indeed,  the  e.d.f.  of  the  period  of  time  elapsed  between  the  receipt  of  the  information  and  the 
completion  of  the  action 

The  quantity  we  are  mainly  interested  in  is  the  number  of  hearers  who  have  completed 
the  action  by  time  /  or  alternatively,  the  number  of  completed  actions  by  time  I.  Besides  com¬ 
puting  the  distribution  of  this  stochastic  process  we  shall  also  obtain  the  distribution  of  associ¬ 
ated  stochastic  processes  of  interest  such  as  the  number  of  hearers  up  to  time  t  or  the  number 
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of  responsive  spreaders  or  hearers  up  lo  lime  :  (when  the  possibility  of  "defection"  is  taken  into 
account  I 

It  is  instructive  to  note  that  the  above  models  bear  relevance  to  propagation  of  epidemics. 
I  he  vocabulary  should  then  be  translated  as  follows:  Information-Disease.  Spreader-Carrier. 
Hearer-Infectious.  Source-Virus  The  specified  action  could  be  interpreted  as  any  event  of 
interest  such  as  recovery  or  the  less  fortunate  outcome 

for  literature  on  spread  of  rumors  see  Diet/  |3|  and  Bartholomew  |2|.  A  comprehensive 
treatise  on  spread  of  epidemics  can  be  found  in  Bailey  111 

2.  SOME  PRELIMINARY  RESULTS 

Let  us  first  present  two  concepts,  which  we  shall  use  extensively  in  the  sequel. 

DEFINITION  1:  A  stochastic  process  with  unit  jumps,  fit),  is  said  to  have  the  order 
statistics  (abbreviated:  OS)  property  if  conditioned  on  fit)  =  ti,  the  unordered  times  of  jumps 
are  distributed  as  a  random  sample  of  size  11  from  a  c.d.f.  /■',(•)  which  we  shall  call  the  kernal 

c.d.f. 

NOTH.  In  this  work  we  shall  consider  only  processes  with  continuously  distributed  "inter- 
jump"  intervals  so  that  /-.(•)  will  always  be  a  continuous  function. 

DEFINITION  2.  A  collection  of  discrete  nonnegative  distributions is  said  to  be  binomi- 
ally  closed  (abbreviated:  BO  if  for  every  P  6  ^and  any  0  <  y  ^  1  there  exists  a  P  6  t^such 
that 

V  —  P.  Aj\,„  ~  Binomial  In,  y)  — >  X  —  P 

or.  restated,  if  V  is  distributed  according  to  a  member  of  £^and  the  conditional  distribution  of  X 
given  \  =  n.  is  Binomial  with  parameters  in.  y),  then  the  unconditional  distribution  of  X  is 
also  a  member  offijP 

Of  particular  interest  are  collections  which  are  parametric  families  of  distributions  depend¬ 
ing  on  some  parameter «  In  this  case  the  above  definition  can  be  reworded  as  follows: 

DEFINITION  2’:  A  parametric  family  of  distributions  =  \PU,H  €f)|  is  said  to  be  BC  if 
for  every  H  €  H  and  any  0  ^  y  ^  I  there  exists  a  »  €  <■)  such  that 

A  —  V  —  Binomial  in.  y)  — *  X  —  P 

where  H  —  HW ,  y ) 

The  function  Wlh.  y)  will  be  called  the  transformation  function. 

Examples  of  uniparameiric  BC  families  of  distribution  are: 

1.  The  Poisson  family  of  distributions 

PJx  )=<'"  —  ,  v  =  0.  1,  2.  ...,«€  H  =  10,  00). 

.v ! 


In  this  case  the  transformation  function  is 
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(I)  HW,yi  =  Hy. 


2.  The  Binomial  family  of  distributions 

/>,(*)  =  -«)*  v  =  0 . K :  fi  €  M  =  |0. 11. 


Here  again 

(2)  »W.y)  =  #y. 


3.  The  Geometrie  family  of  distributions 

P„(x)  =  ti(  \-H)\  x  =  0.  1,  2.  .  ..  ;  H  €  ©  =  10,1], 

Here 

(3)  0(0.  y)  =  11  +  yW  1  -  1)1 


A  useful  tool  for  verifying  whether  a  particular  collection  of  distribution  is  BC  is  provided 
by  the  following  characterization  theorem. 

PROPOSITION  1:  Let  ^be  a  collection  of  nonnegative  discrete  distributions  and  let  S^be 
the  corresponding  collection  of  moment  generating  functions  where  the  m.g.f.  associated  with  a 

distribution  P  is  given  by  G(:)  =  £  z'P(x).  Then^isBC  if  and  only  if  disclosed  under  a 

v-0 

linear  transformation  of  its  independent  variable,  i.e.,  for  every  P  €  fy  and  any  0  <  y  <  1 
there  exists  a  PE  ^Psuch  that 

G(y:  +-  1  -y)  =  G(z ) 

where  GIG)  is  the  m.g.f.  associated  with  P(P). 

The  proof  of  this  Proposition  is  straightforward.  When  dealing  with  parametric  families  of 
distributions  we  have  the  equivalent: 

PROPOSITION  T:  A  family  of  nonnegative  discrete  distribution  ty  =  (P«,0  €  ©1  is  BC  if 
and  only  if  for  every  t)  and  any  0  y  <  1  there  exists  a  0  €  ©  such  that 

(4)  G'„(yr  +  1  —  y)  =  G«(z) 

where  G„  is  the  m.g.f.  associated  with  P». 

Due  to  the  one  to  one  correspondence  between  distributions  and  m.g.fs,  the  transforma¬ 
tion  function  0(0.y)  is  the  same  function  in  both  collections. 

COROLLARY:  If  the  collection  <^is  BC  then  the  collection  tf1'',  formed  by  taking  the 
x-th  convolution  of  each  member  of  ty.  is  BC  too.  The  assertion  is  valid  not  only  for  positive 
integers  x  but  for  any  positive  real  x  for  which  there  exists  a  corresponding  collection  ($'x'  of 
proper  m.g.fs.  In  the  parametric  family  context  we  state  that  if  ^  *  \P».  0  €  W|  is  BC  then  so 
is^'4’  =  l Pi'' .9  €  ©1  where  P is  the  x-th  convolution  of  P„  with  itself.  In  this  case  the 
transformation  function  #(#,y)  remains  invariant  under  the  operation,  i.e.,  it  is  independent  of 


x. 
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The  corollary  follows  Immediately  from  Proposition  I  (or  D  due  to  the  multiplicative 
property  of  m.g.fs. 

The  following  proposition  relates  the  two  concepts  of  OS  property  of  stochastic  processes 
and  BC  property  of  collections  of  distributions. 

PROPOSITION  2:  If  a  stochastic  process  Nit),  with  .V ( 0 )  =  0,  processes  the  OS  pro¬ 
perty,  then  the  collection  of  distributions  of  Nil),  t  ^  0: 

^  0) 

is  BC 

PROOF:  From  the  OS  property  of  Nil),  we  can  conclude  that  PiNis)  =  j/Nit)  —  n) 
=  |”j  Fjis)  (I  -  F,is))n  for  all  0  ^  s  ^  t  and  all  integers  0  ^  j  $  n  (where  0"  =  1). 
Hence, 

PiNis  )=  ./)  =  £  (^]  F,’is)  ( 1  -  F"  ’is))  Pi  Nit)  =  n).  j  =  0.  1,  2 . 

n  -  / 

Multiplying  both  sides  by  and  summing  over  j  from  0  to  00  we  obtain  after  some  manipula¬ 
tions 


=  G\ ,,,(r  F,is)  +  I  -  F,(s)) 

oo 

where  G\,,,(r )  =  £  Pi  Nit )  =  n  ),  is  the  m.g.f.  of  the  distribution  of  Nit). 

rJ-l) 

Hence,  for  every  t  ^  0  and  for  any  0  <  y  <  1  there  exists  an  s(0  <  s  <  r),  such  that 
G\,sl(r)  =  Cvl„(:y  -f  1  -  -y), 
which  is  the  solution  of  equation 
(5)  F,is)  =  y. 

Such  a  unique  solution  does  exist  since  F, is)  is  continuously  increasing  from  0  to  1  in  the 
interval  [0,t]  Proposition  2  now  follows  from  Proposition  1. 

We  are  now  in  a  position  to  state  the  main  theorem. 


PROPOSITION  3  In  an  information  spread  process  (of  the  type  described  in  the  Intro¬ 
duction)  let  Yit)  be  the  number  of  hearers  who  initialed  an  action  up  to  time  rand  let  Xit)  be 
the  number  of  completed  actions  by  time  t.  Then,  if  the  stochastic  process  Yit)  possesses  the 
OS  property,  the  distribution  of  T(r)  belongs,  for  all  t  >  0,  to  the  collection  ^  = 
!  ^1  im*  r  5*  o). 


PROOF:  Assume  that  Yit)  =  n.  Then,  since  Yit)  possesses  the  OS  property,  the  unor¬ 
dered  points  of  time  at  which  the  n  hearers  received  the  information  are  distributed  as  a  ran¬ 
dom  sample  of  size  n  from  a  c.d.f.  F,iu).  Moreover,  the  probability  that  a  hearer  who  got  the 
message  at  time  uiu  <  r)  will  complete  the  action  by  time  r  is  Hit  —  u).  Combining  these 
two  facts  we  have 


Xit) 


—  Binomial  in,p) 


where 
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(61  p  =  J  f/  (t  -  u  )  dl\ In  ) 

Now.  by  Proposition  2,  the  collection  jf>  =  | ^  0)  is  BC  and  hence,  by  the  very 
definition  of  this  property,  the  distribution  of  XU)  belongs  to y/’as  well 

3.  HIERARCHIC  AL  SPREADING 

We  begin  with  a  simple  model  in  which  a  single  spreader  circulates  a  piece  of  information 
according  to  a  Poisson  process  with  parameter  A  i.e.,  the  "interhearing"  times  are  exponentially 
distributed  with  parameter  A.  Upon  receiving  the  information,  -oy  hearer  initiates  an  action 
whose  time  to  completion  is  distributed  according  to  a  general  c.d.f.  III  ).  Il  is  assumed  that  an 
action  can  be  initiated  only  when  the  information  (which  could  be  a  leaflet  or  a  form)  has  been 
received  directly  from  the  initial  spreader. 

By  assumption.  Nil)  is  a  Poisson  process,  viz., 

.V  It )  ~  Poisson  (A  i ). 

It  is  well  known  that  a  Poisson  process  possesses  the  OS  property  with  a  kernal  c.d.f., 

F,(u)  =  -  ,  0  <  u  «Sr. 

J 

so  that  by  Proposition  3  the  distribution  of  XU)  belongs,  for  any  t  ^  0.  to  the  collection  YP  = 

{ P\  (,1,  i  ^  0).  This  collection,  however,  is  identical  with  the  Poisson  family  of  distributions 
and  therefore,  by  ( 1 ), 

(7)  XU)  —  Poisson  |a  J  H(u)du\ 
since  here 

D  =  Ki  and.  by  (6).  y  =  p  =  —■  H!u)du. 

Thus, 

(8)  £  1  .V  ( r )  I  =  A  f '  H(u)du 

•zii 

and 

(9)  Gy,,,lz )  =  exp  [—A  ( 1  -  j )  J  HI  u  id’/J . 

Let  us  now  drop  the  assumption  that  all  hearers  do  act  and  introduce  a  probability  a  for  a 
hearer  to  be  responsive  and  perform  the  action.  The  number  of  responsive  hearers  up  to  time  i 
YU),  is  again  a  Poisson  process  with  parameter  Aa  which  enables  us  to  repeat  the  above  argu¬ 
ments  with  ha  instead  of  A  Therefore,  by  17). 

XU)  —  Poisson  (»ii  f  //(«)</uJ 


A  natural  extension  of  the  above  single  spreader  model  is  achieved  by  designating  some 
of  the  hearers  as  spreaders  These  spreaders,  however,  do  not  perform  the  action.  Specifically, 
we  have  an  initial  spreader  who  begins  at  time  0  to  circulate  the  information  among,  what  we 
shall  call,  second  generation  spreaders  These  spreaders  pass  on  the  information  to  hearers  who 
perform  the  action  All  spreading  is  done  according  to  a  Poisson  process  with  parameter  A. 
The  total  number  of  completed  actions  by  time  r.  can  be  expressed  as 

Sin 

.*\<r>  -  X  X,  ,U) 


V 
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.S'  ( / 1  is  the  number  of  second  generation  spreaders  who  have  received  the  information  by  time 


.V,  ,(/)  is  the  number  of  completed  actions  up  to  time  /  by  hearers  of  the  /-th  second  generation 
spreader.  By  assumption. 

•S'(/ )  ~  Poisson  (X  i ) 


and  hence,  by  the  OS  property  of  the  Poisson  process. 


-  I  —  X,  °'o  <’  *' 

M-O  ^ 

=  exp  |— X  /  +  X  G'vi,  i(r  > t/v] . 


w  (Mr 


Substituting  (9)  into  (10)  yields. 


,  ,  .  C‘  x  1 1  -if  HUiUI 

G|  )  =  exp  -X  i  +  X  J(|  e 


dGx,x,Az)  r, 

(12)  £[*,(/>]  = - ; -  =  X 5  j  (/  -  u)H(u)du. 

d z  ■  .  i  •'u 


The  total  number  of  people  who  know  the  information  by  time  /  (including  second  gen¬ 
eration  spreaders)  can  be  represented  as 


NAD  =  £  (/Vtll(r)  +  1) 


where. 


2V, , ,  ( / ) is  the  number  of  hearers  of  the  /-th  second  generation  spreader,  up  to  time  /. 

Noting  that  the  m.g.f.  of  N,,,(D  is  obtainable  from  the  m.g.f.  of  T, ,,(/).  by  setting  H(u)  =  1 
(u  -S  0),  and  since  G’v,  ^i  +  ife)  =  :G*  the  m.g.f.  of  AM/)  can  be  shown  to  be 

(13)  <7  v )  =  exp  -X/  +  (1  -  e  A,u  '’> 


with  NAi ) 


£'(A(2(/)]  =  X/  + 


If  the  possibility  of  "defection"  is  taken  into  account  and  we  let  I  -  fi  be  the  probability  that  a 
second  generation  spreader  does  not  spread  and  I  —  a  be  the  probability  that  a  hearer  does  not 
perform  the  action,  then,  repeating  the  above  arguments,  we  obtain 


/  ,  ,  ,,  ,  „  f  '  A..U  .‘if  //(ll>,/ll  . 

6,  ,„,<r)  =  exp  -X/i/  +Kf)  J(i  e  J"  dv 


"V 
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f:l.\\<r)l  =  A-’rr/l  f  (I  -  u)HUi)du. 

J  n 

Wc  now  proceed  to  consider  a  general  spreading  hierarchy.  Thus,  in  a  structure  of  order 
A,  the  process  starts  al  time  0  with  an  initial  spreader  who  circulates  the  information  among  the 
second  generation  spreaders,  who  pass  it  on  to  third  generation  spreaders  and  so  on  until  the 
A-th  generation  spreaders  spread  the  message  through  the  rest  of  the  population  who  perform 
the  action.  All  spreading  is  assumed  to  be  according  to  a  Poisson  process  with  parameter  A  and 
the  time  to  completion  of  the  action  has  a  c.d.f.  //(•).  Spreaders  do  not  perform  the  action. 

In  order  to  obtain  the  distribution  of  .VA  < / )  (the  index  A  denotes  the  order  of  the  spread¬ 
ing  hierarchy),  we  first  make  the  observation  that  a  second  generation  spreader  replicates,  with 
regard  to  his  branch,  the  role  of  the  initial  spreader  for  a  structure  of  order  A  -  1.  Hence, 
using  once  more  the  OS  property  of  the  Poisson  process,  we  obtain  the  recursive  equation 

(14)  G\4  |(,,<c)  =  exp  J— A i  +  A  G’f^,  ,(c)c/rj.  A  =  2.  3.  4,  . . . 

where  G (.,,,(;)  is  given  by  (II). 

Taking  the  derivative  of  (14)  with  respect  to  ;  and  setting  r  =  I.  yields  a  set  of  recursive 
equations  for  the  expectations  of  Xkit)  (A  =  2,  3,  ...).  Solving  these  equations,  while  recal¬ 
ling  the  initial  value  AT  A \(r))  in  (12),  we  find 

05)  ATA,</)|  =  77—77  f  ('  -  v)A  'H(vidv.  A  =  2.  3 . 

(In  fact,  both  (14)  and  (15)  also  hold  for  A  =  1  which  represents  a  single  spreader  model.) 

For  small  /,  a  higher  order  of  the  spreading  hierarchy  would  not  necessarily  increase  the 
expected  number  of  completed  actions  — since  spreaders  do  not  perform  the  action  — but  for 
larger  t  this  will  be  the  case.  When  i  tends  to  °°  it  can  be  shown,  using  an  Abelian  argument 
on  Laplace  transforms,  that 

£T l A"a  .,</)!  -  AT ( r )  1  —  <»,  for  any  finite  A. 

I  —■ » 

Similar  arguments  with  respect  to  Nk(t)  —  the  total  number  of  people  who  know  the  informa¬ 
tion  by  time  t  (including  spreaders),  yield  the  recursive  equation 

(16)  G\(  )  =  exp  |-A  /  +  k;  J  A  =  2,  3,,.. 

where  G\ ,,,,(- )  is  given  by  ( 1 3).  It  can  be  shown  from  (16)  and  (13)  that 

(17)  E\Nkil)\  =  X 

which  indicates,  as  one  would  have  intuitively  expected,  that  the  higher  the  order  of  the  spread¬ 
ing  hierarchy,  the  faster  the  spread  of  the  information. 

It  is  interesting  to  investigate  the  behavior  of  Xk(t)  and  Nk(i)  when  A  —  °o.  For  Xk(i) 
we  can  get  from  (14)  that 

(18)  G\ =  1  ^  P[XAt)=  01  =  I. 
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which  is  no!  surprising  since  if  everybody  spreads  ihere  is  no  one  lo  carry  out  the  action,  f  rom 
(lb)  we  can  obtain  an  integral  equation  for  the  m.g.f.  of  V.,(/),  the  solution  of  which  is, 

(19)  6\  ,,,<;)  =  t-  *'[1  -  r<!  -  c  *'))  V 
The  m.g.f  in  (19)  corresponds  lo  the  distribution 

AM.V„(r )  =  n\  =  e  A'(l  -  e  /i  =  (),  1.  2.  ... 

i.e.. 

(20)  V.  ( r )  -  (ieometric  («■  *  ) 
with 

t[,V„(f>  |  =  eA'  -  1. 

This  result  could  have  been  obtained  directly  from  (17). 


An  important  generalization  of  the  hierarchical  spreading  model  arises  when  the  spreading 
rate  of  the  initial  spreader,  which  could  be  a  source,  is  different  from  those  of  the  subsequent 
spreaders.  Repeating  the  arguments  in  the  above  model,  when  the  initial  spreader  circulates 
the  information  according  to  a  Poisson  process  with  rate  /u ,  yields  for  Xk(l)  and  -%(/)  (which 
correspond  to  X\U)  and  NkU),  respectively,  in  the  ordinary  case) 

(21 )  Gx  ,,,(-- )  =  exp  |-/a  i  +  m  0'v4,,  ,(r)</vj 

<221  f'\. ,.„(*)  =  exp[-Mi  +M-X,  GVl,(r)</v]  k  =  2.  3 . 

Differentiating  >21)  and  (22)  with  respect  to  r  and  selling  r  =  1.  yields 

I  [v*  <7)1  =  n  £<'  “  f  t:[XkU)\ 

and 

i:\.\tn\  =  f  =  fn\\\u)l 

A  (  |  /  •  A 

when  A  tends  to  A  ,  (t)  behaves  as  A (see  Equation  (18)).  For  . V , ( / )  we  have,  recal¬ 
ling  1 1 9). 

(7\  ,,,(;)  =  c  A,(l  -  r ( 1  -  e  A,)l  “  A. 


which  corresponds  to  the  distribution. 

/* ( .V .  ( t )  =  n)  —  [^/A  +/("  '  !]  c  A,<1  -  e  A')",  n  =  0.  I . 

That  is. 


(23) 


iV.  (/)  ~  Negative  Binomial 


M 

A 


c 


» i 


4.  FREE  SPREAD  OF  INFORMATION 

In  this  model  we  make  no  prior  designation  of  spreaders  and  assume  that  every  hearer 
may  pass  on  the  information  in  addition  to  performing  the  action.  At  first  glance,  it  may  look 
contradictory  that  a  person  can  do  both  simultaneously,  but  one  should  bear  in  mind  our  intro¬ 
ductory  remark  that  an  action  need  not  involve  physical  efforts.  In  fact,  an  action  could  even 
be  instantaneous  in  which  case  //(■)  is  the  c.d.f  of  the  time  until  the  action  is  taken. 
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As  usual  I  he  process  siaris  at  lime  0  with  an  initial  spreader  who  circulates  the  informa¬ 
tion  according  to  a  Poisson  process  with  parameter  A.  Any  hearer  of  the  information  initiates 
an  action,  whose  time  to  completion  is  distributed  according  to  a  c.d  I  //<•).  and,  simultane- 
ousl>.  goes  on  spreading  the  information  at  the  same  rate  (Poisson  with  parameter  A).  The 
number  of  hearers  up  to  time  i:  XU).  should  have  the  same  distribution  as  V,  (/)  in  the  previ¬ 
ous  model,  so  that 

(24)  XU)  ~  Geometric  ( c  A ') 

This  result  is  also  obtainable  by  the  following  argument  The  time  at  which  the  /i-th  person 
received  the  message  T„.  can  be  expressed  as  the  sum  of  the  successive  "mterhearing"  periods 
of  the  first  n  hearers.  It  can  now  be  observed  that  these  periods  correspond,  in  reverse  order, 
to  the  "interfailure"  periods  of  a  system  which  is  composed  of  n  units  in  parallel  each  having  an 
exponential  lifetime  distribution  7,,  is  therefore  distributed  as  the  lifetime  of  this  system,  i.e  . 

Pi  /,.  ^  r)  =  (1  -  c  *'r.  i  >  0. 

which,  recalling  the  relation  PIT,,  ^  r)  -  PlXU)  >  n  ) .  yields  (24) 


(25) 


The  process  TV < / )  possesses  the  OS  property  |4|  with  a  kernel  c.d.f 
<-A"-  1 


Flu  I  = 


0  s?  u  <  i. 


Hence,  the  distribution  of  A’(r)  belongs,  for  any  t  >  0.  to  the  collection  —  (f\<, ,,  r  >  0) 
which  coincides  with  the  Geometric  family  of  distributions.  Therefore,  using  (5).  we  have 


(26) 

with 

since  here 


V <  i )  —  Geometric  ||l  +  eA'J|  A  c  A  “H  ( u  ) r/i/J  J 
m<r)l  =  <-A'  f  A  (*  A'77(// )du 

J  D 


H  =  c 


and.  by  (6). 

(27)  y  =  p  =  (1  -  c  A'>  1  J  Ac  A"/7 ( u  )du. 

Let  us  now  generalize  the  model  by  making  the  response  of  the  hearers  to  both  spreading  and 
acting  probabilistic  More  precisely,  we  assume  that  every  hearer  is  either  interested  or  unin¬ 
terested.  with  probabilities  fi  and  I  —  /3  respectively,  where  uninterested  hearers  neither  spread 
nor  act  while  those  interested  do  spread  hut  still  may  not  perform  the  action  with  probability  1 

—  a  . 


Letting  Sit)  be  the  number  of  interested  hearers  up  to  time  i.  it  can  be  verified  that  Sit) 
is  the  same  type  of  birth  process  as  Nil),  only  with  A/3  instead  of  A  Thus, 

(28)  Sit)  ~  Geometric  ie  Art'). 

Using  the  OS  properly  of  Sit)  and  the  BC  property  of  the  Binomial  family  of  distributions, 
with  transformation  function  tllH.y)  =  Hy.  it  can  be  verified  that 

(29)  XU )  |Si,, ~  Binomial  in, up). 
where 

/?=<!-('  *"')  1  f  A/3c  *''"//<  h  )du. 

J  i) 
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Applying  Proposition  4  and  using  (3)  with  »  —  (‘  *'*'  and  y  =  ap  wc  obtain 
(30)  Xit)  -  Geometric  ([l  +  «<•*'" M,,7 /(«)</u]  ) 


with 

£[.Y</>]  =  <*<■*'"  f  Kfic  Kl,"Hiu)du. 

J  0 


Like  in  the  hierarchical  spreading  model  we  can  now  generalize  this  model  by  allowing  the 
rate  of  the  initial  spreader  (which  could  be  a  source)  to  be  different  from  those  of  the  other 
spreaders.  Thus,  if  the  initial  spreader  circulates  the  information  according  to  a  Poisson  process 
with  parameter^,  (he  distribution  of  Nit)  should  be  identical  to  that  of  N  .it)  in  the  hierarchi¬ 
cal  spreading  model  (Equation  (23)),  i.e.. 


. V < z )  —  Negative  Binomial 


A 


c 


K  l\ 


It  can  be  shown  that  the  process  Nil)  possesses  the  OS  property  with  the  kernel  probability  dis¬ 
tribution  function  in  (25).  Furthermore,  the  m.g.f.  of  a  Negative  Binomial  distribution  with 
parameters  ix.tt )  is  the  m.g.f.  of  a  Geometric  distribution  with  parameter  h.  taken  to  the  power 
v  (.v  >  0).  Hence,  using  the  corollary  of  Proposition  1',  we  can  conclude  that  the  Negative 
Binomial  family  of  distributions  with  parameter  tt  €  [0,1]  is  BC,  for  any  a  >  0  with  it  given  by 
(3)  Therefore,  by  Proposition  3, 


Xit)  —  Negative  Binomial 


tL 

A 


"H(u )du 


i 


with 

£[*</)]  =  —  cK'  f  Ac  K,‘Hiu)du. 
A  J» 


5.  SPREAD  BY  A  SOURCE 

In  this  model,  we  have  a  source  (some  media)  which,  form  time  0  on,  transmits  a  piece 
of  information  to  a  population  of  size  N.  Any  member  of  the  population  may  hear  the  infor¬ 
mation  in  any  interval  it.t  +  A t).  independently  of  other  members,  with  probability 
AAr  +  0( A z ) ,  at  which  moment  he  initiates  an  action  whose  time  to  completion  has  a  c.d.f. 
Hi  ).  The  distribution  of  the  number  of  hearers  up  to  time  r.  Nit),  is 

Nit)  ~  Binomial  iN.  1  -  e  *'), 

since  the  c.d.f.  of  the  time  until  any  one  of  them  will  hear  the  information  is  given  by 
Liu)  =  I  -  c  A".  tt  ^  0. 

Moreover,  the  stochastic  process  Nit)  possesses  the  OS  property  with  a  kernel  c.d.f. 

I  -  e  * " 

F,iu)  -  - - -  .  0  u 

I  —  e 

Applying  now  Proposition  3  and  using  (2)  we  obtain 

Xit)  —  Binomial  |aG'  a'  A  <'*"//(  u  )</«j 

with 

F\Xii) I  »  Nc  f  A  (’*"// iu)du. 
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since  here  »  =  1  -  <■  A’and,  by  (6), 

(311  y  =  />  =  (<•*’-  I  )  1  A  <’*"// (u  )du. 

L  et  us  now  relax  the  assumption  that  all  the  population  is  exposed  to  the  source  and 
introduce  a  probability  fi  that  a  member  of  the  population  will  hear  the  information  at  all  We 
further  make  the  response  of  the  hearers  to  the  stimulus  probabilistic  and  let  «  be  the  probabil¬ 
ity  that  a  hearer  does  initiate  an  action. 

Denote  by  Z  the  number  of  people  who  are  exposed  to  the  source.  Then. 

Z  —  Binomial  iN.fi). 

Given  Z  =  m  the  conditional  distribution  of  Nil).  the  number  of  hearers  (out  of  the  in 
exposed)  up  to  time  i ,  is 

,V(r)|/,„,  ~  Binomial  (in.  I  -  e  *'). 

Using  the  OS  property  of  Nit)  we  can  show,  (like  in  the  previous  model  — see  Equation  (29)), 
that 

X(l)  \u,  —  Binomial  (n.up),  0  ^  n  ^  in  ^  N 

where  p  is  given  by  (31 ). 

Applying  Proposition  3  and  then  unconditioning  with  respect  to  Z  (which  amounts  to  one 
more  use  of  the  B(  property  of  the  Binomial  family  of  distribution)  we  finally  obtain 

.V(r)  —  Binomial  (,V.o/3  c  K'f)  A  eK"/f(u)du\ 

with 

E | .V < D I  =  ,V afi  e  A'  f  A  eK,‘H(u)du. 

0 

6.  MORE  GENERAL  SPREAD  PROCESSES 

Throughout  this  work  we  have  assumed  that  the  spread  rate  is  of  a  homogeneous  Poisson 
type.  In  this  section  we  shall  employ  our  procedure  to  solve  the  nonhomogeneous  case. 

Specifically,  assume  that  the  spread  rate  of  any  active  spreader  at  time  t  is  a  function  of  r. 
KU).  Beginning  with  the  single  spreader  model  we  have  the  well  known  result 

N(i)  —  Poisson  (A  (r)) 

where 

A  ( I )  =  f  k  (u)du. 

•rn 

The  nonhomogeneous  Poisson  process  also  possesses  the  OS  property  with  a  kernel  c.d.f. 

....  A  (u )  n  ^  . 

/,  ( u  )  =  77-  0  ^  ii  ^  r 
Ml) 

so  that  by  following  the  arguments  in  the  homogeneous  case  we  can  show  that 
XU)  ~  Poisson  (j^  Hi  -  trlr/t/j 
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t  M  < 1 1 1  -  j"  a  (u ) // (r  -  u ) du 
and 

G\.  I  ( r  )  =  exp  j—  ( 1  -  ;  f  A  (m  )//</  -  II  >  f/ll]  . 

Continuing  to  a  hierarchical  spread  structure  of  order  2  we  have 


v-  ,,,,  \  "  ( /  >  /"Alt).,  ,1,1 

=  Z‘-  -^r 

=  exp  |-  \(/)  +  J  A  (.,  )  G'p,  ,,(r)</v] 


where  l  f.t.ri  is  the  number  of  completed  actions  by  time  /  generated  by  a  single  spreader  who 
operates  in  the  time  interval  [c.rl  The  m  g  f.  of  this  r.v.  is  given  by 


ID 


=  exp  -  ( I  -  :  )  J*  Kiu)  ft  if  -  u  )  dll  . 


In  a  similar  way  we  can  obtain  results  for  higher  orders  of  spreading  structures. 

Proceeding  to  the  free  spread  model,  the  solution  of  the  Kolmogorov  backward  equations 
for  the  probabilities  /MAM/ 1  =«)/!  =  ().  I,  2 . yields 

.V  ( / 1  —  Geometric  (<• 

It  can  also  be  directly  verified  that  AM i)  possesses  the  OS  property  with  a  kernel  c.d.f. 

f  ,(u  I  =  ^-r— - L,  0  u  <  i. 

rv,n  -  1 

Repeating  the  arguments  in  the  homogeneous  case  we  finally  obtain  here 
AM/)  ~  Geometric  |  jl  +  f  k  in  )c""' H  (t  —  uldtvj  J 


/;  I  V(/l)  =  f  A  ( 1/  ) «'  '  1 "  1  //  (  /  -  //) 

•'ll 


Turning  to  the  last  model,  in  which  the  information  is  spread  by  a  source,  we  now  assume 
that  each  member  of  the  population  may  hear  the  information  in  the  interval  (r.t  +  A/)  with 
probability  A</)A/  +  nlA/l.  We  then  have 

AM/ 1  ~  Binomial  CV.  I  -  <■ 

and  moreover,  the  process  AM i)  possesses  the  OS  property  with  a  kernel  c.d.f. 


f,Ui  I  - 


0  ^  u  ^  /. 


Applying  Proposition  3  yields  here 


AM i)  —  Binomial  |a, J^kt/Of  "“'HU  -  u)du] 
/:|A</)l=Arf  Mule  '*"’//(/  -  u)du. 
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As  a  matter  of  fact,  our  procedure  can  he  used  in  this  model  to  obtain  a  complete  general 
solution. 

Recalling  our  definition  of  /.  (  )  as  the  c.d.f  of  the  time,  since  the  beginning  of  the 
transmission  of  the  information,  until  a  member  of  the  population  hears  it.  vie  have 

V(/)  —  Binomial  (.V./  fri) 

It  is  not  difficult  to  observe  that  hy  us  very  nature,  the  process  .VI/),  possesses  the  OS 
property  with  a  kernel  c.d.f. 

/•,  (u  I  =  ,  0  ^  u  ^  i. 

Hi) 

l 'Sing  our  procedure,  we  finally  obtain 

•VO)  —  Binomial  j  V.  §  Hu  -  »)<//.  (u)| 

with 

/:  I.VIf  •  I  =  V  J"  //(/  ii  IdUli  l 

Note  that  in  this  case  the  distribution  ol  \(i)  could  also  have  been  obtained  cl  1 1  c  c  1 1 .  in 
defining  a  "success."  for  nn>  member  of  the  population,  as  the  event  ol  having  accomplished 
the  action  by  time  t 
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\BSIR\(  I 

In  this  wnik  m.mm.tl  Nash  subscls  arc  studied  m  order  l<>  show  lli.il  the  set 
of  cquilihtium  points  ot  a  hinialrix  game  is  the  finite  union  of  all  sikh  subsets 
In  .uKInion  the  extieme  points  ol  maximal  Nash  subsets  are  chaiactcu/ed  in 
terms  of  square  siibm.iiriv.es  o|  the  payoff  matrices  and  dimension  relations  are 
derived 


I.  INTROIH  (  I  ION 

A  binuiti i.\  inline  is  defined  hy  a  pair  (  I ,li)  of  real  mivm- matrices.  A  strategy  for  player  I 
(II)  is  an  element  of  V'f.S"),  where  S"  .«  !/><•:  R"*;  /»  ^  0.  £  /»,  =  1|  Corresponding  to  the 

i 

strategy  pair  (/).</ )  C  .S'"  x  .S'1  the  payoffs  are  /'  It/'  and  pHq' .  respectively. 

A  pair  (/>.</)  i  S'"  x  S"  is  called  an  equilibrium  point  of  the  mv/i-bi matrix  game  (  I ,li)  if 
p  \q:  =  max  /»  ((/'  and  p/iq1  =  max  pHq  The  set  of  all  equilibrium  points  of  I  A, Hi.  which  is 

nonempty  by  a  theorem  ol  J  I  Nash  |4.|()|.  vs  ill  be  denoted  by  I  (  I  H). 

NOT. MU  )N:  for  a  natural  number  m.  let  IN,,,  :=  1 1 . ml.  Ihe  elements  of  the 

basis  of  unit  vectors  of  IK"1  are  denoted  by  n . <•„,.  for  a  finite  set  V,  |.V I  is  the  number  of 

elements  ol  S  Ihe  convex  hull  ol  a  set  V  C  Bi™  is  denoted  by  convt.V).  If  C  C  1R'"  is  a  con¬ 
vex  set.  then  we  write  extU  >.  dimU  )  and  relint(C)  for  the  set  of  extreme  points  of  (  .  the 
dimension  of  < t he  alline  hull  of)  (  and  the  relative  interior  of  (  .  respectively 

Let  (  I  ,H)  be  an  mivm-  hi  matrix  game  and  let  </»..; )  €  S’"  x  .S'".  It  is  well-known  <(T  |7|. 
theorem  4)  that  ( p.q)  £  U  i.H)  iffC(/>)  c  \/t  l.q)  and  (  it/)  c  \Hp.H).  where  C(/>)  (the 
earner  of  />)  :=  !/  €  IN,,,;  p,  >  <)} .  (  (</)  :=  \j  €  IN,,;  </,  >  0),  V/(  l.q)  ;  = 
1/  C  IN,,,,  e  it/'  -  max  e,  (</')  and  \Hp;li)  :=  1/  €  IN,,.  pHe]  =  max  plie[  I. 

.  i\.  .  ■  IN. 

Ihe  organi/ation  of  the  paper  is  as  follows.  In  Section  2  we  show  that  the  set  of  equili¬ 
brium  points  of  a  hinialrix  game  is  the  union  of  convex  poly  topes  The  equilibrium  point  set 
can  therefore  be  constructed  if  we  know  the  extreme  points  of  these  convex  poly  topes  These 
so-called  extreme  equilibrium  points  are  studied  in  the  third  section  As  a  by-product  we  find 
that  the  set  of  equilibria  is  in  fact  a  finite  union  Finally,  dimension  relations  are  given  for  the 
convex  polytopes  mentioned  before. 
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2.  I  UK  SIR l  (11  RK  OK  MAXIMAL  NASH  SI  BSKTS 

1)1  I  INI  I  IONS  I  ci  (  -t.fi)  he  ,i  bimairix  game  and  lei  V  C  UA.B)  We  call  ;wo  equili- 
brium  points  (/».</ ).  ip'.q')  f  S  S-mlert  lianpi able  if  ip.q)  (  S  and  ip’.q)  (  V.  We  call  two 
equilibrium  points  micnhanpeahle  if  they  are  /.  <  L/D-interchangeable.  We  call  .V  a  .Wv/t  subset 
for  ihe  game  (  l./O  if  every  pair  of  equilibrium  points  in  Vis  .S-interchangeable  A  Nash  subset 
S  is  called  a  minimal  Sash  subset  lor  ihe  game  i. -t.fi)  if  there  exists  no  Nash  subse!  /  C 
/  (  I  /O  such  that  S  is  properly  contained  in  / 

Ihe  term  maximal  Nash  subset  was  lirsi  introduced  by  (i.  A  lleuer  and  C.  B.  Millham  in 
|4|  J  I  Nash,  who  already  considered  such  sets  m  lhs|  |||)].  called  them  sub-solutions. 

I  hese  authors  showed  that  a  maximal  Nash  subset  for  an  mxn- hi  matrix  game  is  a  closed  and 
convex  subset  ot  V  x  y  I  he  following  theorem  implies  that  a  maximal  Nash  subset  is  in 
fact  the  (  artesian  product  of  two  convex  polytopes 

I  III  OKI  \1  1  l  et  (  l./O  he  an  w.v/t-himatrix  game  and  let  S  be  a  maximal  Nash  subset 
lor  the  game  t  -t.fi)  Suppose  that  (/>.</ 1  f  relint  LS).  I  hen  S  =  A  <y  I  x  Up),  where  A  (y) 
!/.  *  V".  t  />.</ 1  f  /  ■/<  l./O!  and  Up)  :=  |y  c  V":  ip.q)  t  /.l.l./O]  are  convex  polytopes. 

PROOI-  l  et  7T,(.St  =  (/»  €  S'":  there  exists  a  </  €  .V”  with  </>.y)  €  .V|  and  rr  t.S)  :  = 
{</  *.  V  .  there  exists  a  p  (  S'"  with  ip.q)  (  .V).  Since  it  is  clear  that  S  =  W|(.V)  x  rr.I.Y).  the 
theorem  is  proved  il  we  can  show  that  7r,(.V)  =  A<y)  and  7r  s  ( ,S  )  =  lip)  The  inclusions 
7t  (  St  C  A  (y  •  and  7t  ■  (  S  i  c  l  ip)  are  immediate.  Suppose  that  p  (  Alt/)  and  q  €  w  >(.V(. 

Since  q  r  reltnt  it  i  S ).  I  heorem  6  4  of  1 1 1 1  implies  that  there  is  a  q'  €  n  AS )  and  a  A  €  (0,1) 

such  ihat  y  *  A  y  +  ( l  V  >y’  I  torn  y  V  it  AS)  C  IAp)  it  follows  that  p  (  Kiq).  Also,  p  6 
Alt/  ).  Hence.  Alt/)  n  Aly  l  ^  <6  and  Lemma  3  5  of  [4]  implies  that  Alyl  =  Kiq)  ft 

A  < y  l  So  />t  A  ( y  I  and  !/> !  *  it  .(.SI  C  UA.f i).  If  />  ?  n  , ( .S' ) .  then  conv(»r  ,(.V)  U  lp|)x 

it  d. S'l  is  a  Nash  subset  properly  containing  the  maximal  Nash  subset  S.  This  leads  to  a  con¬ 
tradiction  So  p  •  n  (.SI  and  we  have  proved  (hat  Kiq)  C  7t  ,  ( .S  •  In  a  similar  manner,  one 
can  show  that  Up)  <■.  jt.LS'I  finally.  it  is  well-known  that  A(y)  and  l.ip)  are  convex 
polylopes  I  ; 

Ihe  following  l  emma  can  be  proved  in  the  same  wav  as  Theorem  I  in  12]. 

LI.MMA  1  I  el  i  A.H)  he  a  bimatrix  game  II  (  is  a  convex  subset  ol  UA.B).  then 

every  pair  of  equilibrium  points  in  (  is  interchangeable 

It  is  well-known  that  a  maximal  Nash  subset  is  a  convex  set  not  properly  contained  in  any 
other  convex  subset  of  the  set  of  equilibrium  points  Ihts  property  is  characteristic  lor  maxi¬ 
mal  Nash  subsets  as  we  will  prove  now 

THLORLM  2  Let  (. -t.fi)  he  a  bimairix  game  and  let  (  be  a  convex  subset  of  UA.B) 

not  properly  contained  in  any  other  convex  subset  ol  U  t.B)  Then  (  is  a  maximal  Nash  sub¬ 

set  tor  the  game  <  4.  B  I 

PROOI  (a)  first  we  prove  that  (  =  \ip.q)  €  UA.B).  there  exists  an  (  v.  i  )  €  UA.B) 

with  (v.y).  ip.\ )  (  (  I  is  a  convex  set  If  ip.q).  ip.q )  €  (,  then  there  exist  (.v.y),  fv.y)  € 

UA.B)  such  that  (v.y).  ip.y)  £  (  and  (v.y),  </>.()  6  (  But  then  (A.v  +  (l-A).v,  Ay  + 

( I  -  A  )y ).  (A  p  +  (1-A)/?.  Ay-K  (I -Alyl  €  ('.for  all  A  (  (0,1)  In  view  of  the  foregoing 

Lemma,  we  may  conclude  that  (A.v  +  (l-A).v.Ai  +  (I— A)y)  £  /-.’( A.B)  and  (A  p  +  (I  —  A  )p. 
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A  </  +  <I~A)</)  £  IAA.B).  Tor  all  A  £  (0.1)  Consequently,  A  (p.q )  +  (I  -K)(p.q)  f  C.  lor 
.ill  A  t  (0.1)  (b)  Also,  C  is  a  Nash  subset.  U  (p.q).  (p.q)  £  C,  ihen  there  exist  (v.i).  (v.i) 
t  l-AA.fi)  as  in  (a)  Note  that  (.v,r),  (.v.i')  £  (  So  Lemma  I  implies  (with  C  in  Ihe  role  of 
(')  that  (v.v  )  £  IAA.B )  Similarly,  (p.q)  £  IA  A.B)  Since  (p.q).  (.v.i  I  £  IAA.B)  and  (/i.r). 
(v.i/)  £  C.  it  follows  that  (p.q)  £  C  Similarly,  (p.q)  €  C,  and  (p.q)  and  (p.q)  are  (- 
interchangeable  te)  Because  C  is  convex  and  CC  C.  it  follows  that  C  —  O  So.  in  view  ol 
(b).  C  is  a  Nash  subset  It  is  obvious  that,  m  addition.  C  is  a  maximal  Nash  subset,  i  i 

COROl  l  ARV  I  (Cl  1 2 1 .  Theorem  I):  If  (  A.B)  is  a  hi  ni.it  ri  x  game,  then  IA  A.B  I  is  con¬ 
vex  if  and  only  if  IA  A.B)  is  a  Nash  subset 

RLMARK  I  Let  (A.B)  be  a  himatrix  game  and  let  I/m/)  f.  IAA.B)  Since  \(p.q)\  is  a 
Nash  subset  for  the  game  < A.B).  we  can.  applying  Zorn's  lemma,  find  a  maximal  Nash  subsel 
containing  (p.q  >.  Consequently,  every  equilibrium  point  of  the  game  (  A.B)  is  contained  in  a 
maximal  Nash  subset  and  IAA.B)  is  the  union  of  such  subsets. 

3.  FXTRFMF  POINTS  OF  MAXIMAL  NASH  SI  BSFTS 

Lor  a  matrix  game  L.  S  Shapley  and  R.  N  Snow  (121  characterized  all  pairs  of  extreme 
optimal  strategies  of  the  players  We  want  to  describe  lor  the  case  of  bimatrix  games,  the 
extreme  points  of  the  maximal  Nash  subsets  Our  approach  incorporates  the  work  of  II  W. 
Kuhn  (> I  and  O.  L  Mangasarian  [6| 

l)LI  INITION:  An  equilibrium  point  of  a  bimatrix  game  (  A.B)  is  called  an  extreme  equili¬ 
brium  point  if  it  is  an  extreme  point  of  some  maximal  Nash  subsel  for  the  game  (  A.B). 

In  |6|.  OL  Mangasarian  introduced,  for  an  w.vn-bimatrix  game  (  A.B).  the  convex 
polyhedral  sets  PH  :=  ( (p.ti)  £  -V'"  x  IR.  pBe\  $  0  for  all  /  £  IN,,)  and  Q,  :  —  {(«/,<» )  £  S"  x 

IR:  e,Aq  <  or  for  all  /  €  IN,,,)  These  sets  play  also  a  role  in  the  proof  of  the  following 

THUORLM  3:  The  set  of  equilibrium  points  of  a  bimatrix  game  is  a  (not  necessarily  dis¬ 
junct)  union  of  a  finite  number  of  maximal  Nash  subsets. 

PROOF:  Let  .S'  be  a  maximal  Nash  subset  lor  the  game  (A.B)  and  suppose  that  (p.q)  £ 
exll.S)  and  that  (p.q)  £  relinl(.V).  Then,  by  Theorem  I.  we  have  p  £  ext (K(q))  and  q  € 

ext  (Up))  The  reader  can  easily  prove  that  this  implies  that  ( p.pBq ')  £  ext  (P/()  and  that 

(q.pAq)  €  ext((3,)  (C'f.  1 5 1 .  Lemma  1)  Hence,  if  (p.q)  is  an  extreme  equilibrium  point  of 
the  game  (A.B).  then  (p.pBq1. q.pAq')  £  ext  </’„)  x  ext((2|)  Since  ext  (P„)  and  ext  ( CZ  i )  are 
finite  sets,  the  number  of  extreme  equilibrium  points  of  the  game  (  A.B)  is  also  finite.  Hence, 
the  number  of  maximal  Nash  subsets  is  finite  I  i 

RLMARK  2:  In  (6|.  O.  L.  Mangasarian  called  an  element  (p.q. ut.fi)  £  .S'”'  x  .V"  x  |R  x  IR 
an  extreme  equilibrium  point  ol  the  rnv//-bimairix  game  (  A.B).  if  (p.fi)  €  ext(/V-  (</,<»)  £  ext 
<(2|)  and  /H  I  4  B)q'  =  u  +  fi  It  is  easy  to  show  that  a  point  (p.q. tt.fi)  is  an  extreme  equili¬ 
brium  point  in  ihe  sense  off)  L.  Mangasarian  if  and  only  if  (p.q)  is  tin  extreme  equilibrium 
point  in  the  sense  of  definition  I  and  if  furthermore  n  =  pAq'  and  /3  =  pBq'  Therefore. 
Theorem  3  implies  the  Lemma  on  page  77d  of  |b| 
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K1  MARK  .V  I  lie  extension  of  fheorem  3  to  the  ease  of  more  than  two  players  does  not 
necessarily  hold  On  page  3  of  12),  II  II  (  Inn,  1  Parihasaraihy  and  I  I  S  Raghavan  give  an 
example  ol  noneooperalne  3-person  game,  where  all  the  players  have  the  set  S'  as  strategy 
spaee  and  where  the  set  ol  equilibrium  points  is  equal  to  the  (convex)  set  (t(A.l-A), 
t  A  .  1  A  i,  (A  .  I  A  )  I  t  S’  x  .S  x  S  .  A  (  (o.lll  I  his  set  ol  equilibrium  points  is  the  union 
ot  an  uncountable  number  of  maximal  Nash  subsets. 


for  a  prool  of  the  following  theorem,  see  Lemma  2  of  II  W  Kuhn  1 5 1 . 


fill  OKI  M  4  let  (•(./#)  be  an  w.vn-bimatrix  game.  If  Ip.q)  is  an  extreme  equilibrium 
point  of  the  game  (  I.B)  and  y  is  the  number  of  elements  of  the  carrier  of  q,  then  there  exists 
a  y  \ y-submatrix  A  of  -I  such  that  (renumber,  if  necessary,  the  rows  and  columns  of  A  in  such 
a  way  that  A  is  in  the  upper  left  corner  of  A 1 


1 1  i 


the  (y  *  I )  x  (y  t  I ) -matrix  A 


A 

I 


is  nonsingular. 


( 2 1  q  ~  (del i  A  M  1  ]T  A  if  i  i  Uq  )  and 

I A  is  the  cofacior  of  the  element  A  ,  I 
( 3 1  />  lq  =  da  (  A  da  (A) 


An  analogous  statement  can  be  formulated  with  respect  to  the  connection  of  the  vector  p  and 
the  number  pBq  with  a  certain  square  submatrix  of  B. 

Rl  MARK  4  Let  IA.B)  be  a  bimatrix  game  Without  loss  of  generality  we  may  suppose 
that  A  >  0  and  B  <  0.  Let  .S'  be  a  maximal  Nash  subset  for  the  game  IA.B).  Suppose  that 
Ip.q)  €  relini(.S')  and  that  Up)  —  (</).  Note  that  the  proof  of  Theorem  4  is  based  on  the  fact 
that  the  rank  of  the  matrix  -I  (.S')  :=  If/,,1,.  Ul  ,  „ ,  ,, ,  ,,, ,  equals  I  Uq)  I.  Using  the  fact  that  A 
>  0.  Theorem  4  (3)  implies  that  dim  Up)  =  (('(</)(  -  rank  A  IS).  We  shall  see  in  Theorem 
5  that  a  similar  statement  holds  for  sets  Up)  with  more  than  one  element.  If  Alq)  —  (p), 
then  dim  Air/)  =  |(  Ip )  |  -  rank  BIS),  where  BIS)  :=  lb,,),,  , 

4.  A  DIMENSION  RELATION  FOR  MAXIMAL  NASH  SUBSETS 

The  purpose  of  this  section  is  to  extend  the  dimension  relations  as  given  by  C.  B  Millham 
in  (81  The  relations  derived  below  include,  in  contrast  to  the  results  in  Millham's  paper,  those 
for  the  ^ero-sum  case  (Cf.  (I).  |3|). 

LLMMA  2  Let  IA.B)  be  a  bimatrix  game  and  let  S  be  a  maximal  Nash  subset  for  IA,B) 
Suppose  that  Ip.q)  (  relint  <  .S' ) .  Then,  for  all  Ip.q)  £  S,  Up)  C  Up),  Uq)  C  Uq  >, 
MlA.q)  3  MlA.q)  and  MIp.B)  D  M!p\B). 

PROOL:  Suppose  that  p  £  Klq),  p  ^  p.  Because  p  €  rclinl  Klq),  there  exist  a  p  £ 
Airland  a  A  £  (0,1 )  such  that  p  =  A/i  +  ( 1  —  A  )/5.  This  implies  that  Up )  C  Up).  Now, 
for  /  £  MIp.B), 

pBq'=  pBc  =  kpBc’  +  (I  —  A  )pBi''t  <  A  pBq'  4-  (1  -A  )pBq'=  pBq'. 

This  is  possible  only  if  pBv't  —  pBq'  So  /  £  MIp.B)  and  we  have  proved  that  MIp.B)  C 
MIp.B)  The  other  assertions  arc  proved  in  a  similar  way.  I  I 
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DELINITION:  Let  ( A.B )  be  a  bimalrix  game  and  Icl  S  be  a  maximal  Nash  subset  lor  the 
game  A.B).  In  view  of  Lemma  2,  the  matrices 

A  (.S')  :=  [c/,,1,.  i  and  B ( S )  :=  |A„I,.  , 

do  not  depend  on  the  choice  of  the  point  (p.q)  €  relint(.V).  We  call  .4  (.S')  and  BiS)  the  S- 
submatrices  of  A  and  B.  respectively. 

THL.ORLM  5:  Let  (.-LB)  he  an  m.vn-bimatrix  game  with  A  >  0  and  B  <  0  Let  .She  a 
maximal  Nash  subset  for  the  game  (A.B)  If  (p.q)  €  relint(.V), 

then  ( I  >  dim  /.  (/))  =  |('(</ )  I  -  rank  A  (S ) 

and  (2)  dim  K(q)  =  |C(/>)[  -  rank  B(S). 

PROOF:  We  only  prove  (1).  If  Lip)  has  only  one  element,  we  are  finished  (Remark  4). 
Suppose  now  that  Up)  contains  more  than  one  clement.  There  is  no  loss  of  generality  in  sup¬ 
posing  that  Ciq)  —  {l . >  I ,  where  y  —  |C(y)|.  Let  d  y  —  rank  .4t.S).  Choose  a 

basis  v(l) . {•((/)  of  Ker  A  (SI  :=  (v  6  Rr;  A(S)x'  —  0)  in  such  a  way  that,  for  each  k  € 

IN,/,  q  +  v  ( A  )  >  0,  where  q  :=  (q , . </r),  and  pAq'  -  e,Aq'  >  e,Ax(k)'  for  each  /  <? 

AKA  ,q  ).  where  x  (k)  :  =  Lv (A  >, 0 . 0)  €  DC.  We  normalize  the  vectors  q  +  x(k)  in  such 

a  way  that  the  normalized  result  v(k)  is  an  element  of  S".  We  leave  it  to  the  reader  to  show 

that  the  vectors  q.  y(  I ) . y(d)  are  linearly  independent  vectors  in  Lip).  Hence,  dim  Up) 

£  it.  Suppose  now  that  there  exists  a  vector  .i(c/  +  l)  e  relint  Lip)  such  that  the  vectors 

'•!)-</ . >  (c/  +  l)  -  q  are  linearly  independent.  Then,  in  view  of  Lemma  2,  Ciyiil  +  1 ) I 

=  f  ill )  and  M  i  A  (r/  +  l))  =  M  (A  ,q) .  So  if  f  ( k  )  :  =  (,c  (A  )  | . y  ( A  )y ),  for  each  A  6 

IN,,.,,  then  A(S)[i(k)lpAy(kV  -  q/p.Aq']  =  0,  for  each  A  6  IN,/.,.  This  is  impossible  since 
dim  Ker  A  (.S')  =  d.  So  dim  L  ip)  =  d.  |  | 

It  is  easy  to  prove  that  Theorem  I  in  (8|  is  implied  by  Theorem  5. 

Lor  a  matrix  game  A,  the  only  maximal  Nash  subset  is  t1"  set  S  of  all  pairs  of  optimal 
strategies  for  both  players.  In  this  case,  the  S'-submatrix  of  A  equals  the  essential  submalrix  of 
A  (Cf.  [21.  page  44)  and  the  dimension  relation  foi  matrix  games  follows  from  Theorem  5. 
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IM ROOK TION 

In  a  beautiful  paper,  I  I  Vilkas  pave  a  eharaeteri/ation  of  the  value-function,  defined  on 
the  class  of  all  finite  matrix  games  1 2 1  In  III,  pp.  (>()-()>  this  result  was  extended  in  the  class  of 
till  finite  and  semi-infinite  matrix  games 

The  purpose  of  this  paper  is  to  deduce  characterizing  properties  for  the  value-function  on 
the  set  of  all  determined  two-person  games  The  organi/utinn  of  the  paper  is  as  follows  the 
necessary  notation  and  definitions  tire  given  in  sections  I  and  2.  in  section  3.  properties  for  the 
value-function  are  presented,  which  are  shown  in  section  4  to  be  characteristic  of  this  function. 

I.  A  Izero-sum)  uvo-persuii  name  is  an  ordered  triple  <  \.).k  >,  in  which  \  and  )  are 
nonempty  sets  (called  the  pure  snciieny  spares  of  player  I  and  player  If.  respectively!  and 

k A  x  >  —  |R  is  a  real-valued  function  on  the  Cartesian  product  of  V  and  I  (called  the  pa\ 

off  luneinm  of  play  er  I  > . 

2  Let  <  \,).k>  be  a  two-person  game  l  or  each  \  •  V  <  i  f  >  I  let  us  denote  the 
probability  measure  on  \  (  >  I  with  mass  I  m  \  <i)  bv  c,  (c  t  I  el  l\  be  the  set  of  all  convex 
combinations  of  elements  of  !<■..  v  '■  A'!,  likewise  let  /’,  be  the  convex  hull  of  !<■  .  i  -  )  ) 

Then  the  two-person  game  </V  /  with 

/  *,  I  yi  /  l  :=  ^ ^  K  ( x.  i  I  tip  ( v  )  tie  <  i  >  lor  each  (/u  ,  r  I  ■  l\  y  I1. 

is  called  the  <  -mixed  extension  u/  the  an  me  <  V.  ).k>  I  he  lower  value  sup  inf  /.^(gi.i'1 

of  the  game  <  l\.l\  ,f.K  >  ts  denoted  bv  y<  A.  ).k)  and  the  upper  value 

inf  ,  ,,  sup,,  /.  /.*,  *  is  denoted  by  v(A  ,).k)  Note  that 

-co  ^  y (  V.  >.  A  )  $  v  ( A,  }.AI  $  >». 

If  y(,V,  Y.k)  =  v  ( A .  > .  A.  >  for  a  game  <\.).k>.  then  we  say  that  the  game  is  a  determined 
name.  In  that  case,  the  common  value  is  denoted  bv  v(  V,  ).k  >  and  called  the  value  of  (the  c- 
mixed  extension  of)  <  A. ) .k  >  The  familv  <>(  dete  mined  games  is  denoted  hy  /) 
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3  In  ilm  section  »o  want  to  look  at  some  distinguished  properties  ol  the  value-function 
v  /)  —  |  no,  xi |  for  this  purpose  we  need  some  definitions 

1)1  (  INI  1  ION  I  The  irtins/w \e  of  a  two-person  game  <  V.  LA  >  is  the  two-person  game 
•c  L  Y.  A  >  w  here 

A  (y.  \t  -  A.  <  \ .  i  )  for  each  (i.\)  1  >  *  V 

1)1  I  INI  I  ION  2  Let  <  V.  LA  >  be  a  two-person  game  and  let  S  he  a  nonempty  subset  of 
V  Then  we  say  that  S  is  snlliucni  lor  plover  I  in  ihr  wmic  <  A.  LA>  if  for  each  \  <  X  S 
there  exists  apt  l\  such  that 

/:\  (p  .c  )  i  A  <  a .  t  )  for  each  i  (  L 

DIMM  I  ION  V  I  et  <  V.  LA  >  he  a  two-person  game  and  let  7  be  a  nonempty  subset  of 
L  We  say  that  7"  is  su/liuem  lor  plover  II  in  ilic  name  <  A.  1 .  A  >  if  7  is  sufficient  for  player  I  m 
the  game  <  L  V.  A  > 

THEOREM  I 

IP.  1 1  ("Objectivity" I  Let  <  Y.  LA  >  be  a  two-person  game  and  suppose  that 
.V  =  |tfj.  >  =  { /> |.  Then  <  .V.  LA  6  /.)  and  v( A,  LA  >  =  A  («./>). 

I  P.2)  ["Monotonicity"  I  Let  <  A.  LA>  €  I)  and  <  A.  >./.  >  €  I)  and  suppose  that 

l.  >  A  ti  e.  7-C.v.v)  ^  A  Lv.t  )  for  each  Lv.t)  €  Yx  ) ),  Then 
vl.Y. ) ,L)  >  vLY.LAL 

IP. 3)  |"Symmetry“l  Let  <  .Y,  T.A  >  t  />.  Then  <L.Y.  ~A  >  £  /)  and  v(  T..Y. -AT  - 
- v(  Y.  LA  I 

<P.4>  |"SuHiciency"|  Let  <  Y.  T.A  >  be  a  two-person  game,  and  </>  ^  .V  C  .Y  and  let 

A  :  .S'x  >  —  IK  he  the  restriction  of  A  to  .S'x  L  Suppose  that  .V  is  sufficient  for 

player  I  in  the  game  <  A,  LA>  Then  <.Y,  LA  >  (.  I)  itT  <  A,  ).A>  €  7)  and 

vl.Y.  T.A)  =  v  (  S.  L  A  >  if  <  V. L  A  >  t  /). 


PROOF:  IP  I)  and  (P.2)  are  obvious  (P.3)  follows  from  the  fact  that 

-  /:\  < /x  .  r  )  =  I:  ^  (r . p  )  lor  each  t/a  .  r )  €  7'y  x  /J, 

Now  let  us  prove  (P.4),  first  we  note  that  7\  can  be  seen  tin  an  obvious  manner)  as  a  subset 

of  l\.  and  that  /.j,  is  the  restriction  of  llK  to  /Ax/',. 

Take  o  €  l\ .  Then  there  exist  n  €  IN,  .v'..v* . v"  €  V  and  /»|./<: . p„  €  [(), oo )  such 

')  II 

that  £  Pi  =  I  and  o  =  £  /»,<\-  Since  V  is  suITtcieni  for  player  I  in  the  game  <  .V.  LA>  lor 

-  i  .  t 

each  i  €  1 1 . // ) .  there  exists  an  a,  L  l\  such  that 

l:K( ii  .«■, )  A(  \  .t  )  for  each  t  (  L 

III  \  ‘  S.  then  we  can  take  <t  =  c  |  l  et  o  :=  £  /><>  I  hen  o  f  /\  and 

i 

l.K  (« .e,  )  ■=  JT  />  /.a,  to  ,c  I  J  £/>AI\.iD  /-.jy  <««.r,  tfor  each  i  t  >. 

i  i 
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Bui  then. 

(il  ^  l^io.r)  lor  each  it  i  l\  ;i  ml  each  r  C  /’> . 

This  implies  that 

sup,,  i\  I-k  I /i.i’l  -  sup,.  /.  /.^td.i'l  for  each  r  r  /J, 

ami  thus 

( it )  v  (  V.  >.A>  -  vt.V.  > .  A  ) 

I  rum  til  we  may  also  conclude  that 

ml  (>i  Iti.r)  >  ini’  ;>1  l:\(< r.r)  lor  each  It  € 

ami  then 

lull  v(  .V.  T.A  )  =  v(  V,  T, A  ) . 

Now  (P.4)  follows  front  lii)  and  (iii).  !! 

4  I  he  following  theorem  shows  that  the  properties  (PIMP. 4)  characterize  the  value- 
function  V  :  /)  —  l-oo.oo  | . 

THLORLM  2  Let  1:1)  —  be  a  function  with  the  following  four  properties: 

(Q  1  >11  V  =  (</).  T  =  [h]  and  if  A  is  a  real-valued  function  on  A'x  ),  then 
/  <  V.  T.A  )  =  A  <«,/>). 

(Q  2)1  or  each  <  A.  T.A>  e  I).  <  V,  >./.>  €  /)  with  L^K  HX.Y.U  >  /  (.V,  >.A). 
(Q. 3)1  or  each  <  A.T.A>  £  I)  /(T..V.-A )  =  -/(.Y.T.A). 

(Q  411  or  each  <  V.  T.  A  >  €  /A  and  <.V.  T.A  >  €  A  where  V  C  A.  A  is  the  restriction 
of  A’  io  Vx  T  and  where  V  is  sufficient  for  player  I  in  the  game  <  .Y.  Y.K  > ,  we 
have  US.  Y.K  >  =  /  (  .V.  T.A). 

Then  /(.Y.T.A)  =  v(  A',  T.A)  for  each  <  A.  T.A  >  t  A 

PROOf  first  we  note  that  (Q.3I  and  (Q.4I  imply 

(Q.5)Tor  each  <X.Y.K>  €  /fund  <X.T.K">  £  A  where  V  C  T.  A"  is  the  restric¬ 
tion  of  A  to  A'x  /'and  where  A  is  sufficient  for  player  II  in  the  game  <  A.  T.A>. 
we  have  /  (  A,  7. A  )  =  /  (  A.  T.A'). 

Now  take  an  <  Y.T.A  >  £  I)  with  vt.V.  T.A)  t  t-oo.oo)  and  take  a  real  number  i  such  that 
vt.Y.  T.A)  >  I.  We  want  to  prove  that  /’(  V.  T.A)  >  i.  for  this  purpose  we  introduce  the  fol¬ 
lowing  live  two-person  games: 

(1)  <  .V  U  |«).  Y.l.  >  where  a  9  X  and  where  /.  <  \.  v )  :=  Atx.r)  tor  each  t.x.vl 
£  A'x  >' and  lAu.y  (  :=  i  for  each  v  f  T 

(2)  <  ,V  U  |«|,T..V/>  where  A/lv.r)  :==  minimum  | Atx.i  )./)  for  each  ( x.t  ) 
(  (A  U  |r/|)x  > 
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i.  ]/>',.  \  >  where  ft  '/  >  and  where  Nix. 
!,/;>*  )  ami  \  <  v.ft  1  /lor  each  \  •  \  1  / 

1  ) 

<  ,  1 

A/ 1  v.i  I  for  each 

(4) 

<  I  <j  ! , 

. )  v!  ft 

l.\  •  where  \  is  the  testiKlion  ol  V  to  It/ 
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1  * 
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">  where  \  is  the  restriction  ol  \  to  It/!  »  1 

h  \ 

Since  \IA.).A>  >  /.  there  exists  a //  ■  /’,  such  llwl 

/./  t/a  •  /  «,  l/a  c  I  ■  /  /  1 1/  i  i  I m  eav h  i  •  ) 

Hence.  Vis  sufficient  for  player  I  in  ihe  game  ■  I  .  It/!.)./  •  Ifi  t|’4)  and  (Q4>  we  may 
conclude  that 

(Q.6)  <  V  U  !«!.>./.>  C  /)  and  /' \  IJ  !</!.>./  >  /'\.).A  i 

li  follows  from  <Q  1  )  that 

|Q  7  I  /  I  |n|. |ft|.  \  i  /. 

In  the  game  <  \a\.  I  U  |ft|.  V  >  the  set  |/>!  is  sullicient  lor  placet  II  because 

/  s  ((.<■.  >  \  1 a. i  I  -  t  lor  each  i  •  ) 

I  hen  <  |t/  I .  )  U  |  A ! .  V  '  •  /)  in  view  of  <  P  3 1  and  1 1’  4  > .  mm  (Q  s  l  mi  (dies 

(Q  Si  /  (|t/|.  >  u  | /> | .  \  i  -  ;  (!</',.  [b\.  \  I 

In  the  game  <  V  U  lt/1.  >  U  |  A ' .  V  >  the  set  ! </ !  is  sullicient  lor  player  I  because  lor  each 
\  e  \ 

i  --  /  \  (<•,.«•  I  >  \t\.i  I  for  each  t  •  >  U  !ft|. 

By  IP. 4)  and  <Q4)  we  obtain  <  V  U  It/!.  )  U  |ft|.V>  •  /)  and 

(Q.9)  /I  V  u  !</!.)  u  ! />  1 . \  I  -  /(!<;!.  l  U  Ifti.vi 

It  is  easy  to  see  that  )  is  sullicient  for  player  II  in  the  game  <  V  U  '  ‘ .  )  U  !/>!.N  >  Hence, 

by  IP. 3)  and  (P.4):  <  V  U  jt/|,>.  \/>  <  I ).  and  then  by  (Q  5) 

(Q.l()>  /(  v  u  !<;!.>. U)  /(  v  u  !,/!.  >  u  ’,/>!. \  ). 

Now  /.  £  V/ and  then  by  (Q.2)  we  have 

<Q. 11)  /(  V  U  |</!.)./.)  J  /l  V  U  |t/!.).A/>. 

Combining  (Q.6I-IQ  1 1 )  we  obtain  /  <  \ .  > .  A  )  ?  /  I  bus.  we  have  prosed  that  MV.}, A)  ^  i 

for  each  <A’,),A>  (.  /.)  with  v(  A.  ).A  )  f  (  -«>.«o|  and  each  /  <  v(V.).A)  But  then 

(Q  12)  /  (  A.  ).k)  2  v(  \.). A  )  for  each  <  V.  >.A  >  <  /> 

It  follows  from  IQ. 3).  (Q  121  and  IP  3)  that 

(Q  1 3)  /(.V.  >,  A  )  =  -  / 1  ) . A.  A  I  <  v I  >.  V.A  )  -  \(  V  I  A  )  lor  each  <  V.  >  A  >  6  /). 

Properties  (Q  12)  and  I Q I  3 )  imply  the  conclusion  of  the  theorem 
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MANPOWER  MODELING  IN  COST  EFFECTIVENESS 
STUDIES  OF  USAF  PROGRAM  TO  REDUCE 
THE  INCIDENCE  OF  HEART  DISEASE* 

(  litVord  (  Petersen 

Purdue  l  ntwrsih 
lies /  I. a  tawin'.  Indiana 

ABSTRACT 

Planning  !<>i  a  cardiovascular  disease  reduction  program,  soon  to  he  initiat¬ 
ed  hv  the  1  tilled  Slates  Air  force,  has  required  an  e\alu.tin>n  ut  its  expected 
com  effectiveness  During  the  course  of  this  evaluation,  it  was  necessarv  to 
consider  manpower  Hows  and  their  expected  changes  in  response  to  the  disease 
reduction  program  1  his  paper  describes  several  manpower  models  that  were 
applied  .1  simple  expected  value  equilibrium  model,  a  cross-sectional  model 
that  considered  the  length  of  service  of  personnel,  and  a  stalling  model  used 
t<>  optimize  the  allocation  of  paramedics  to  the  many  Air  force  bases  of  various 
sizes  I  he  relevance  of  these  models  to  the  cost  effectiveness  evaluation  is 
shown  hut  the  detailed  cost  effectiveness  results  are  not  presented 

Analyses  are  being  performed  to  evaluate  the  cost  effectiveness  of  a  U  S.  Air  Force  health 
program  that  is  soon  to  be  initiated.  The  "Health  Evaluation  and  Risk  Tabulation"  (HEART) 
program  will  be  directed  toward  cardiovascular  disease  that  strikes  several  hundred  Air  Force 
personnel  annually  and  results  in  a  considerable  loss  of  personnel  through  death  and  disability. 

THE  HEART  PROGRAM 

In  \er>  general  terms,  the  HEART  program  will  involve  processing  all  military  personnel 
in  the  Air  Force  to  establish  each  individual's  risk  of  future  heart  disease,  followed  by  treat¬ 
ment  of  those  found  to  he  at  high  risk  This  will  be  done  by  measurement  of  systolic  blood 
pressure,  serum  cholesterol,  glucose  intolerance,  and  determining  heart  abnormality  (left  ven¬ 
tricular  hypertrophy)  by  means  of  an  electrocardiogram.  Also,  it  will  be  determined  whether 
the  individual  smokes  cigarettes  regularly.  These  data  and  age  are  used  with  the  risk 
coefficients  developed  through  the  Framingham  Study  [2]  to  calculate  for  the  individual  the 
probability  of  occurrence  of  a  cardiovascular  incident  within  eight  years.  The  coefficients  are 
based  on  over  20  years  of  followup  on  a  large  civilian  population,  and  have  succeeded  in  clus¬ 
tering  about  25  percent  of  the  heart  incidents  into  the  top  decile  of  risk.  The  possibility  of 
coefficient  modification  and  the  inclusion  of  other  risk  indicators  is  being  anticipated  in  the 
USAF  program. 

The  calculated  risks  will  serve  to  identify  the  most  susceptible  fraction  of  the  USAF  for 
treatment,  and  recalculation  after  treatment  will  serve,  in  some  measure,  to  show  the 

'Based  on  part  of  the  research  performed  for  the  l  VAI  School  of  Aerospace  Medicine  h\  Purdue  f  mversiiy  under 
Contract  I  .AlM>-77-(  '-0(04 
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improvement  that  was  obtained.  Obviously,  the  ultimate  benefit  will  become  apparent  only  in 
the  long  term  when  the  actual  incidence  of  heart  disease  can  be  observed.  In  addition  to  the 
treatment  of  high  risk  personnel,  all  personnel  will  be  reached  through  a  general  education  pro¬ 
gram  to  encourage  improved  dietary  habits  and  cessation  of  smoking. 

Various  analyses  are  being  directed  toward  therapy  effectiveness,  threshold  selection  poli¬ 
cies.  the  effect  of  measurement  error,  and  operational  procedures,  as  well  as  toward  the  evalua¬ 
tion  of  cost  effectiveness.  Statistical  and  probability  models  and  extensive  computer  simulation 
are  being  used.  This  paper,  however,  will  describe  only  the  application  of  manpower  planning 
models  to  the  determination  of  the  cost  effectiveness  of  the  III- ART  program.  The  population 
numbers  and  dollar  costs  that  will  be  used  herein  are  altered  and  somewhat  incomplete  but 
serve  for  illustrative  purposes',  the  actual  analysis  used  the  complete  and  most  recent  informa¬ 
tion  on  population,  turnover  of  personnel,  pay  scales,  and  policies.  The  complete  cost 
effectiveness  analysis  will  not  be  presented  as  it  is  only  intended  here  to  show  the  applicability 
of  several  manpower  planning  models  to  that  analysis. 


THF  COST  REDUCTION  PROBLEM 

Only  the  costs  to  the  U.S.  government  that  will  be  affected  by  the  HEART  program  need 
be  considered.  The  major  present  costs  that  will  be  changed  are  those  associated  with  USAF 
personnel  departing  from  service  and  their  subsequent  replacement.  It  is  necessary  to  identify 
and  associate  costs  with  the  various  ways  in  which  personnel  leave  the  Air  Force.  These  costs 
are  different  for  enlisted  personnel  and  officers  because  of  pay  scales,  and  different  for  flyers 
(pilots  and  navigators)  and  nonflyers  because  of  the  considerable  cost  of  training  a  replacement 
flyer.  An  additional  cost,  estimated  at  $1,000,000  per  year,  is  that  due  to  loss  of  aircraft 
because  of  heart  attacks  suffered  by  the  pilots. 

The  various  types  of  departure  will  now  be  described  briefly.  Voluntary  and  involuntary 
separation  (or  simply  separation)  includes  resignation,  failure  to  reenlist,  and  reduction-in-force 
terminations.  Except  in  the  case  of  flyers,  these  types  of  departure  are  considered  to  incur 
negligible  costs  Voluntary  retirement  (or  simply  retirement)  occurs  when  an  individual  retires 
with  from  20  to  30  years  of  service.  The  departure  cost  is  substantial,  including  payment  of  50 
to  7s  percent  of  the  individual's  salary  to  the  individual  or  his  spouse  for  a  period  usually  in 
excess  of  30  years.  Disability  retirement,  disability  separation,  and  assignment  to  the  temporary 
disability  retirement  list  (TDRL)  are  forms  of  departure  for  reason  of  30%  or  more  disability, 
and  must  be  considered  separately  for  cardiovascular  (CV)  disabilities  and  other  (non-CV)  disa¬ 
bilities  Because  cardiovascular  related  separations  and  TDRL's  practically  always  become  per¬ 
manent.  they  are  lumped  with  C'V  disability-retirements  in  this  analysis.  The  departure  cost  is 
substantial,  including  hopitalizalion  and  continuing  payments  to  the  individual  and  to  the 
spouse  over  a  period  usually  in  excess  of  30  years.  Departure  by  death  is  self-explanatory  and 
its  vost  is  analogous  to  that  for  disability  retirements. 

In  determining  the  cost  per  retirement,  disability  retirement,  or  death,  it  seems  reasonable 
that  the  long  series  of  benefits  paid  to  the  individual  or  spouse  should  be  discounted.  It  is 
perhaps  not  surprising  that  it  was  difficult  to  determine  what  rate  to  use,  and  that  the  agreed 
upon  approach  was  to  use  two  rates,  5  and  10  percent,  for  separate  analyses.  The  cost  of  CV 
departures  decreases  by  approximately  20%  when  changing  from  5%  to  1 0%  discounting. 
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Table  I  summarizes  the  approximate  costs  For  each  type  of  departure: 

TABLE  I.  Cost  of  Each  Departure  from  U.S.  Air  lone 
(  Thousands  of  Dollars) 


Officers 

Enlisted 

Personnel 

Separation 

Retirement 

C’V  Disability-Retirement 

CV  Death 

Non-C’V  Disability-Retirement 
Non-CV  Death 

Flyers 

280.0 

474.1 

531.9 

431.7 

449.8 
353.6 

Nonflyers 

0 

194.1 

251.0 

151.3 

155.8 

81.1 

All  Categories 

0 

109.9 

153.7 

85.3 

72.0 

32.4 

All  future  obligations  brought  to  present  worth  using  a  5  percent  discount  rate. 

Given  the  cost  for  each  departure,  and  knowing  the  present  average  number  of  CV 
disability-retirement  and  C'V  death  departures  over  the  past  several  years,  it  is  simple  to  calcu¬ 
late  the  annual  departure  cost  due  to  cardiovascular  disease.  The  anticipated  effectiveness  of 
therapy  in  reducing  C’V  incidence,  through  the  HEART  program,  can  then  be  assumed  (we've 
used  20  percent  here).  A  naive  approach  to  determining  the  cost  reduction  is  to  claim  20  per¬ 
cent  of  the  annual  CV  departure  cost  (from  which  the  operating  cost  of  the  HEART  program 
would  be  subtracted  to  obtain  the  net  annual  savings).  This  approach,  however,  neglects  the 
effect  of  the  reduction  in  CV  departures  upon  the  other  types  of  departure.  It  neglects,  for 
example,  the  possibility  that  a  person  saved  from  C'V  death  may  be  killed  in  another  way.  or 
that  he  must  ultimately  leave  in  some  manner,  typically  incurring  a  departure  cost.  Aiso 
ignored  is  the  beneficial  effect  of  the  HEART  program  in  delaying  the  occurrence  of  heart 
attacks  in  the  individuals  who  will  still  suffer  them. 

To  deal  with  the  interaction  between  the  various  types  of  departure,  two  different  models 
were  formulated.  Both  are  based  on  the  assumption  of  a  steady  state  manpower  system. 

THE  STEADY  STATE  SYSTEM 

Although  the  U.S.  Air  Force  will  probability  never  be  in  a  true  steady  state  condition,  it  is 
as  reasonable  to  use  such  a  condition  for  the  analysis  as  to  hypothesize  any  other  unknown 
future  state.  The  strategy  is  to  model  a  steady  state  force  having  the  same  size,  distribution  of 
personnel,  and  departure  rates  as  the  present  force,  and  then  to  hypothesize  a  20  percent  reduc¬ 
tion  in  the  cardiovascular  departure  rates  and  determine  what  the  new  steady  state  condition 
would  be.  The  difference  in  annual  departure  costs  associated  with  the  two  systems  would  be 
attributable  to  the  20  percent  reduction  in  CV  incidence.  Proportional  cost  cha  iges  would 
result  from  any  other  assumed  reduction  in  C’V  incidence. 

There  are  two  primary  requirements  that  must  be  satisfied  in  order  to  maintain  steady 
state  Obviously,  the  annual  number  of  new  entries  must  equal  the  annual  number  of  depar¬ 
tures  for  each  class  of  personnel.  In  addition,  for  each  class,  the  total  length  of  service  in  years 
of  all  persons  departing  in  one  year,  must  equal  the  number  of  persons  in  the  system  (a  conse¬ 
quence  of  N  man-years  of  service  being  accumulated  each  year  by  a  force  of  size  N). 

In  the  present  U.S.  Air  Force,  the  number  of  new  entrants  is  less  than  the  number  of 
departures  for  enlisted  men.  Also,  the  total  length  of  service  for  enlisted  men  departing  per 
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year  exceeds  the  number  of  this  class  of  personnel,  and  confirms  a  shrinking  force  with  a  large 
fraction  of  the  population  having  many  years  of  service.  The  steady  state  turnover  rate  indi¬ 
cated  by  length  of  service  of  those  departing  (reciprocal  of  average  length  of  service)  is  some¬ 
what  below  the  actual  turnover  rate  being  experienced.  In  contrast,  the  situation  is  reversed  for 
enlisted  women.  Nevertheless,  with  certain  assumptions  as  to  future  recruitment  and  incen¬ 
tives  it  seems  reasonable  to  conceive  of  a  model  of  the  USAF  at  steady  state. 

EXPECTED  VAI.l'E  EQUILIBRIUM  MODEL 

1  his  model  for  adjusting  the  other  departure  rates  as  the  C’V  departure  rales  decrease  is 
simple  and  requires  little  data.  It  ignores  the  length  of  service  requirement  for  steady  state, 
simply  assuming  that  it  will  be  met. 

The  initial  steady  state  How-  is  illustrated  for  enlisted  men  in  Figure  1  and  requires  that 
we  know  only  the  steady  state  total  population  count  which  will  remain  constant,  the  fractions 
lor  the  various  types  of  departure,  and  consequently  the  fraction  remaining  active  each  year. 
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The  C'V  departure  rates  arc  then  considered  as  being  reduced  by  20  percent.  It  is 
assumed  that  the  manpower  flow  that  would  have  departed  due  to  CV  disease  is  diverted  to  the 
other  types  of  departure,  and  to  continuing  service,  in  proportion  to  their  respective  rates.  A 
rationalization  of  this  i.ssumption  is  possible  by  viewing  the  departure  fractions  or  rates  as  pro¬ 
babilities.  Individuals  who  win  a  reprieve  from  C'V  disease,  and  will  have  to  be  routed  to  other 
types  of  departure  or  to  continuing  service,  are  distributed  according  to  the  appropriate  proba¬ 
bilities.  At  steady  state,  with  reduced  C'V  departure  rates,  we  recalculate  new  rates  for  other 
types  of  departure  and  extend  them  to  numbers  of  departures  as  shown  in  Figure  2. 

Flows  are  adjusted  for  the  other  classes  of  personnel  in  a  similar  manner.  For  pilots  and 
navigators,  however,  there  is  a  large  replacement  training  cost  for  the  increased  voluntary  and 
involuntary  separations. 
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I  VV  i)-CH  ARACTERISTIC  CROSS-SECTIONAL  MODEL 

This  model  III  was  chosen  to  impose  the  effect  of  length  of  service  on  the  analysis.  This 
effect  is  important  because  it  is  presumed  that  the  effect  of  implementing  the  HHART  program 
will  be  not  only  lo  reduce  the  rate  of  CV  departures  but  to  postpone  the  time  of  departure  of 
the  fraction  who  will  still  depart  because  of  cardiovascular  reasons.  The  data  requirements  are 
reasonable,  not  requiring  detailed  tracking  of  cohorts,  but  primarily  adding  data  regarding  the 
average  length  of  service  at  departure. 

In  this  model  we  define  a  matrix,  P.  of  one-step  transition  probabilities,  where  each  state 
is  described  by  two  characteristics,  a  status  and  a  length  of  service.  The  model  has  more  capa¬ 
bility  than  will  be  used,  as  it  serves  our  need  by  defining  only  one  status,  namely  "active," 
rather  than  various  ranks,  for  example.  The  analysis  will  be  performed  separately  for  each  class 
of  personnel  and  we  will  assume  no  How  of  personnel  between  classes,  such  as  from  enlisted  to 
officer  or  vice  versa. 

(iiven  the  matrix  P.  completely  defined  by  knowing  the  average  length  of  service  at  time 
of  departure  and  the  fraction  of  total  departures  for  each  type  of  departure,  we  note  that  the 
limit  P"  will  be  the  steady  state  transition  matrix.  This  matrix  will  have  identical  rows,  II, 

where  the  /  th  element,  it,,  is  the  proportion  of  the  population  in  state  j  at  equilibrium.  The 
vector  II  is  determined  by  solving 

II  =  IIP 

2>,=  i- 

Ml, 


The  strategy  will  be  to  find  the  steady  state  departures  for  our  initial  data,  then  to  reduce 
the  CV  departure  probabilities  by  20  percent  and  increase  the  length  of  service  to  CV  departure 
by  an  estimated  two  years,  and  again  find  the  steady  state  condition.  Considering  the  cost  of 
each  type  of  departure,  the  annual  savings  in  departure  costs  due  to  the  effect  of  the  HEART 
program  will  then  be  calculated. 

EXAMPLE.  Use  of  the  P  matrix  will  be  demonstrated  with  a  very  small  example.  Fol¬ 
lowing  this  the  enlisted  personnel  will  be  analyzed  to  show  some  of  the  adjustment  that  had  to 
be  made  in  our  assumptions,  and  to  give  results  for  comparison  with  those  of  the  expected 
value  equilibrium  model. 


Suppose  there  is  an  organization  with  one  class  of  personnel  and  three  types  of  departure. 

Each  year  from  now  on  five  persons  will  resign  after  two  years  of  service,  five  will  be  disabled 
after  three  years  of  service,  and  fifteen  will  retire  after  five  years  of  service.  Since  the  total 

length  of  service  of  the  departing  personnel  is  100  man-years,  the  size  of  the  organization  must  I 

be  100  at  steady  state.  All  departing  personnel  are  immediately  replaced.  We  wish  to  find  the 
steady  state  distribution  of  personnel  by  length  of  service. 


Each  year  25  persons  enter  the  system  and  each  year's  group  behaves  as  follows: 


End  of  Year 

1 

2 

3 

4 

5 


Fraction  Remaining 
For  the  Next  Year 
1.0 
0.8 
0.6 
0.6 
0 
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Defining  active  duty  states  (,4,  n),  where  n  is  the  number  of  years  of  completed  service, 
the  one-step  transition  matrix,  P,  is  as  shown  below.  The  probabilities  of  changing  in  one  year 
from  state  A.  i  to  state  A,  i  +1  for  /  =  0,  I.  2,  3,  4  are  obtained  from  the  table  of  fraction 
remaining,  shown  above.  The  probabilities  of  changing  from  state  A,  i  to  state  A,  0  are  proba¬ 
bilities  of  leaving  the  system,  in  which  event  a  new  person  enters  the  system  with  0  years  of 
service. 


To: 


.4,  0  .4,  1 

A ,  2 

A ,  3  .4,  4 

A ,  0 

1. 

A,  1 

0.2 

0  8 

From:  A ,  2 

0.25 

0.75* 

A ,  3 

1. 

A,  4i 

1. 

‘Fraction  going  from  A, 2  to 
A, 3  =  0.6/0. 8  =  .75 


Solving  II  =  IIFand  £  it,  —  1,  we  obtain  fl  =  (0.25, 0.25.  0.20. 0. 1 5, 0. 1 5J . 

All, 

The  interpretation  of  this  solution  is  that  for  this  organization,  there  will  be  25  percent 
new  personnel,  25  percent  who  have  completed  one  year  of  service,  20  percent  who  have  com¬ 
pleted  two  years,  15  percent  who  have  completed  three  years,  and  15  percent  who  have  com¬ 
pleted  four  years  of  service  and  who  will  retire  at  year  end. 

If  the  P  matrix  and  its  resulting  equilibrium  distribution  n  are  judged  applicable,  that  is,  if 
the  mechanisms  underlying  the  departures  are  such  that  the  numbers  of  departures  of  the 
different  types  stay  in  the  same  relative  proportions,  then  for  any  size  organization  the  numbers 
of  departures  at  steady  state  can  easily  be  derived. 

To  conclude  this  example,  assume  that  the  desired  size  of  the  organization  is  120,  and 
that  II  =  [0.25, 0.25, 0.20, 0. 1 5. 0. 1 5]  still  applies  as  the  distribution  of  length  of  service.  From 
7r n  we  know  that  25%  of  the  organization  (30  persons)  will  depart  each  year.  These  departures 
are  then  prorated  over  the  types  of  departure  as 

Resignations  =  30  x  5/25  =  6 

Disabilities  =  30  x  5/25  =  6 

Retirements  =  30  x  15/25  =  18 


Enlisted  Personnel  — Initial 

The  force  size  for  enlisted  personnel  is  500,000.  However,  the  presently  observed 
numbers  of  departures  from  the  Air  Force  as  shown  in  Table  2(a)  are  not  consistent  with  a 
steady  state  model  with  500,000  population.  In  fact,  they  imply  an  equilibrium  population  of 
545,450.  One  way  to  retain  our  observed  departure  information  in  a  steady  state  model  having 
a  population  of  500,000  is  to  decrease  the  number  of  each  type  of  departure  to 
500,000/545,000  of  its  observed  value.  This  is  based  on  the  equilibrium  requirement  that: 
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TABI.f.  2.  Calculation  of  Unlisted  Steady  State  Departures 
Before  HEART  Program 


Average  Length 
of  Service. 

■  (.it  (Years) 

Observed 

Number 

Departing 

Separation  4 

Retirement  22 

(A  Disability -Retirement  20 

(  A  Deaths  10 

Non-CA  Disability -Retirement  6 

\on-( A  Deaths  7 

i  Implied  steady  state  population:  545,450) 

50.000 

15.000 

100 

50 

1,500 

500 

67.150 

<  b)  1  he  annual  number  of  departures  for  a  force  of  500.000  at  steady 
state,  with  proportional  scaling,  are: 

Separation 

45,833.71 

Retirement 

13.750.12 

CV  Disability-Retirement 

91.67 

CV  Death 

45.83 

Non-CV  Disability -Retirement 

1.375.01 

Non-CA'  Death 

458.34 

61,554,68 

id  The  annual  number  of  departures  for  a  force  of  500.000  at  steady 
state,  with  selective  settling,  are: 

Separation 

47.000. 

Retirement 

13,480. 

(  V  Disability-Retirement 

100. 

CV  Death 

50. 

Non-CV  Disability-Retirement 

1.500. 

Non-CV  Death 

500. 

L  . . 

62.630. 

where  n  is  Ihe  number  of  departures  of  the  /  ih  type,  y  is  their  average  length  of  service,  and 
A  is  the  total  population  Because  all  s,  remain  the  same,  a  change  in  ,V  can  be  accommodated 
by  changing  all  u,  proportionally  as  shown  in  Table  2(b).  Proportional  scaling,  as  just 
described,  seems  valid  in  making  small  adjustments,  but  for  large  adjustments  such  as  this  the 
reasonableness  of  the  effect  on  each  type  of  departure  deserves  examination. 

Another  way  to  construct  the  steady  state  model  is  to  decrease  the  numbers  of  departures 
selectively.  The  data  on  the  number  of  disability-retirements  and  deaths,  whether  from  C'V 
disease  or  other  causes,  as  observed  in  a  present  force  of  500,000  enlisted  personnel  should  not 
be  treated  cavalierly  .  They  should  not  be  adjusted  appreciably  in  the  initial  steady  slate  model 
because  there  is  no  logical  basis  for  reducing  the  incidence  in  contradiction  to  the  medical 
records.  The  types  of  departures  that  can  logically  be  reduced  to  achieve  a  hypothesized  steady 
state  Air  force,  are  separations  and  retirements,  assuming  that  Air  f  orce  inducements  and  poli¬ 
cies  were  modified  to  effect  such  reductions.  A  reasonable  assumption  is  that  the  annual 
number  of  separations  can  be  reduced  about  6  percent  (from  50.000  to  47,000).  Leaving  the 
number  of  departures  for  disability-retirement  and  death  unaltered,  we  derive  the  required 
annual  number  of  retirements,  u, .  from 
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47.000(4)  -f  n,  (22)  ♦  IC0I20)  *  SOHO)  +  1,500(6)  +  500(7)  =  500. 000  obtaining 

13.480  retirements  annually  'ne  steady  state  results  are  shown  in  Table  2(c).  These  results 
will  be  used  to  represent  the  initial  departure  distribution  of  enlisted  personnel,  before  installa¬ 
tion  of  the  IIMAR  I  program 

Knlisted  Personnel  — After 

It  was  shown,  in  the  example  presented  earlier,  that  if  given  an  initial  set  of  data  compris¬ 
ing  the  annual  number  of  each  type  of  departure  and  the  average  age  at  departure,  the  steady 
stale  si/e  of  the  population  can  be  calculated  Also,  an  equilibrium  distribution.  II.  can  be  cal¬ 
culated  to  describe  the  distribution  by  length  of  service  <as  well  as  the  annual  number  of  each 
type  of  departure). 

If  the  initial  set  of  data  is  perturbed,  a  new  population  si/e  and  new  numbers  of  each  type 
of  departure  can  be  calculated  for  steady  state.  The  desired  population  si/e  can  be  restored  by 
proportional  or  selective  scaling. 

The  perturbation  applied  to  the  initial  steady  state  data  lor  enlisted  personnel  is  the 
assumed  effect  of  the  IIMART  program,  that  is,  a  reduction  of  CV  departures  by  2<*  percent  and 
tin  increase  of  two  y  ears  in  the  average  age  of  those  departing  because  of  CV  disease  I  able 
3(a)  shows  the  steady  state  result  for  these  assumptions,  and  tin  implied  population  of  499.660 
The  population  was  restored  to  500, 000  by  proportional  scaling.  Table  3(b)  shows  the  result, 
after  scaling,  and  the  change  from  the  "before  IIMART  program”  result  of  Table  2(c) 


TABLM  3.  Calculation  of  Enlisted  Steady  State  Departures 
After  HEART  Program 


(a) 

Average  Length 
of  Service 
(Years) 

Assumed 

Number 

Departing 

Separation 

4 

47,000.00 

Retirement 

22 

13.480.00 

CV  Disability-Retirement 

22 

80.00 

CV  Death 

21 

40.00 

Non-CV  Disability-Retirement 

6 

1.500.00 

Non-CV  Death 

7 

500.00 

(Implied  steady  state  population 

499.660) 

62.600.00 

<b)  The  annual  number  of  and  changes  in  departures  for  a  force 
of  500.000  til  steady  state,  after  proportional  scaling,  tire: 


Change  in  Unlisted 


Number 

Departing 

Personnel  Departing 
Due  to  IIMART  Program 

Separation 

47.031.96 

+  31.96 

1  Retirement 

13.489.17 

+  9.17 

CV  Disability-Retirement 

80.06 

-  19.94 

CV  Death 

40.03 

'9.97 

Non-CV  Disability- Retirement 

1.501  02 

+  1.02 

Non-CV  Death 

500.35 

62,642  59 

+  0.35 
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Using  the  departure  changes  shown  in  Table  3(b)  and  the  departure  costs  of  Table  1,  the 
annual  change  in  departure  costs  of  enlisted  personnel  due  to  the  HEART  program  is  a 
decrease  of  S2, 822, 656  per  year.  This  model  realistically  yields  a  larger  increase  in  retirements 
than  shown  by  the  expected  value  equilibrium  model  (Figure  2),  thereby  accounting  for  most 
of  the  reduction  in  savings  (vs.  $3,820,638). 

Annual  reductions  for  other  classes  of  personnel,  and  for  other  assumptions  of 
effectiveness  of  the  HKART  program,  are  generated  in  a  similar  way. 

ALLOCATIONS  OF  PARAMEDICS 


One  of  the  primary  increased  costs  of  the  HEART  program  is  that  of  additional  personnel 
needed  to  operate  the  program.'  A  problem  arises  in  the  efficient  allocation  of  numbers  of 
paramedics  to  the  various  USAF  bases  while  recognizing  that  the  bases  are  of  different  sizes. 

Knowing  the  number  of  military  personnel  of  each  base,  and  assuming  a  risk  threshold 
that  is  consistently  used  at  all  bases  and  that  will  place  an  identical  fraction  of  each  base's  popu¬ 
lation  under  treatment,  we  can  define  the  following: 


.V  =  the  specified  fraction  of  base  population  that  is 
to  be  treated  in  the  therapy  group. 

0,  -  the  number  of  paramedics  required  at  the  i  th  base. 
0,  =  the  population  of  the  i  th  base. 


Knowing  the  details  of  the  proposed  treatment  and  screening  tasks,  and  that  there  is  one  nurse 
available  part-time  at  each  base,  we  determined  x, ,  the  capacity,  or  maximum  fraction  of  the 
base  population  that  can  be  treated,  as  a  function  of  base  size  and  number  of  paramedics  allo¬ 
cated.  This  involved  careful  analysis  of  the  time  required  for  each  task  as  well  as  consideration 
of  allowances  for  rest  breaks  and  vacations  The  resulting  capacity  for  the  i  th  base  is  deter¬ 
mined  as: 

Hours  for  Screening  +  Hours  for  Therapy  +  Hours  for  Group  Sessions  = 

Available  Hours  of  Nurse  +  Paramedics 

19803(0,)  +  7.875  ( 0, )  (x, )  +  1050  =  1800  +  1800(0) 

1800(0)  -  .19803(0,)  +  750 
Wr  X=maX’ - 7.87570)  - ’° 


Nonnegativity  must  be  enforced  explicitly.  Some  of  the  effort  of  the  nurse  and  paramedics 
involves  screening  of  all  base  personnel  and  it  would  be  possible  to  obtain  a  negative  value  for 
v  (the  fraction  that  can  be  treated)  if  the  screening  effort  exceeded  the  available  manpower. 

The  objective  is  to  determine  the  minimal  set  of  0,  such  that  x,  ^  X  for  all  i.  We  will 
first  formulate  a  simple  mathematical  programming  approach  for  determining  0,  and  then  use  il 
to  justify  an  even  simpler  computerized  allocation  scheme. 
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Mathematical  Programming  Model 

All  USAE  bases  may  be  grouped  inlo  26  si/e  ranges,  and  we  will  define  A,  as  Ihe  number 
of  bases  of  the  i  th  size,  i  —  1.2 . 26.  We  wish  to  determine  /', ,  the  number  of  paramed¬ 

ics  to  assign  to  all  bases  of  size  /,  for  any  value  of  .V  that  is  chosen.  The  integer  linear  program 
is: 


Min  £  A, /'  subject  to  twenty-six 


constraints,  one  for  each  base  size,  of  the  form 
7.875</f,  )  (A  )  +  .  198031#, )  -  750 


where  the  variables  /’,  are  nonnegative  integers. 

Each  constraint  is  a  function  of  only  one  variable,  B,,  since  A'  is  fixed  and  B,  is  known. 
Solution  of  such  a  program  to  determine  each  B,  and  to  minimize  the  total  required  number  of 
paramedics  would  be  possible  but  very  time  consuming.  As  an  alternate  method,  note  that 
each  constraint  may  be  satisfied  by  merely  fixing  B,  as  the  smallest  feasible  nonnegative  integer. 
This  will  obviously  minimize  the  objective  function  because  minimizing  each  term  of  a  sum 
minimizes  the  sum. 

An  optimal  assignment  will  not  necessarily  produce  full  utilization  of  all  paramedics,  but 
there  will  be  no  assignment  using  fewer  paramedics  which  will  permit  treatment  of  the  slated 
fraction  <X)  of  the  population. 

Simple  Allocation  Algorithm 

The  allocation  algorithm  starts  with  a  specified  value  of  A’,  and  considers  only  the  nurse 
assigned  to  each  base.  The  maximum  possible  therapy  group  size,  v, ,  with  full  utilization  of 
this  allocation  is  then  calculated  and  updated  for  each  base.  If  \  >  .V  for  all  /,  this  allocation  is 
optimal  for  the  stated  A.  Otherwise  the  base  for  bases)  with  the  smallest  fraction  of  personnel 
in  therapy  f.v,)  is  then  "given"  one  paramedic,  and  the  calculations  arc  updated.  This  procedure 
is  continued,  assigning  additional  paramedics,  until  the  desired  therapy  group  fraction  is 
attained  for  all  bases.  In  summary,  the  procedure  initializes  the  B,  vector  at  0  and  determines 
.V,  which,  because  of  a  uniform  threshold  policy  at  all  bases,  will  be  the  smallest  fraction  among 
all  bases.  Then  the  B,  vector  is  increased  in  the  most  efficient  manner  until  the  specified  value 
of  A'  is  attained. 

The  procedure  is  easily  continued  to  obtain  solutions  for  an  entire  range  of  A  values.  A 
typical  set  of  solutions  shows  the  total  number  of  paramedics  required  to  range  from  254  for  a  7 
percent  therapy  group  to  567  for  a  19  percent  group  lor  the  7  percent  therapy  group,  the  indi¬ 
vidual  base  requirements  range  from  0  to  5  paramedics  and  for  the  19  percent  group,  from  0  to 
1 1  Overall  utilization  for  the  two  cases  is  .75  and  88.  respectively 

TOTAL  LOST  EFFECTIVENESS 

The  total  cost  effectiveness  was  expressed  as  a  net  annual  savings  and  was  a  function  of 
the  risk  threshold  selected  (which,  in  turn,  governed  how  many  people  would  be  treated)  and 
the  assumptions  made  regarding  the  effectiveness  of  therapy 


cc  i*m i  rsi  n 


V 


1 68 


Net  annual  savings  =  Departure  cost  reduction  +  lost 
aircraft  cost  reduction  +  cost 
reduction  in  CV  nondepartures’  — 
paramedic  costs  —  operating,  drug, 
and  test  costs. 

It  has  not  been  the  intent  of  this  paper  to  present  the  results  of  the  cost  effectiveness 
analysis  but  only  to  describe  several  manpower  planning  models  used  in  performing  it.  The 
models  permit  estimation  of  changes  of  some  of  the  complex  cost  elements.  Computer  experi¬ 
mentation  was  then  possible  to  aid  in  certain  decisions  such  as  determination  of  therapy  group 
size  and  treatment  intensity  PI 
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'Hospitalization  and  noneffectivcnc.vs  costs  of  the  personnel  suffering  mild  C  V  disease  which  does  not  result  in 
disability-retirement  or  death  will  also  he  reduced  by  an  effective  I  lb  ART  program 
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ABSTRACT 

I  his  paper  describes  an  empirical  evaluation  of  several  approximations  to 
Hadley  and  Whitin's  approximate  continuous  review  inventory  model  with 
backorders  It  is  assumed  that  lead  lime  demand  is  normally  distributed  and 
various  exponential  functions  are  used  to  approximate  the  upper  tail  of  this  dis¬ 
tribution  These  approximations  offer  two  important  advantages  in  computing 
reorder  points  and  reorder  quantities.  One  advantage  is  that  normal  tables  are 
no  longer  required  to  obtain  solutions,  and  a  second  advantage  is  that  solutions 
may  he  obtained  directly  rather  than  iteratively.  These  approximations  are 
evaluated  on  two  distinct  inventory  systems  It  is  shown  that  an  increase  in 
average  annual  cost  of  less  that  1%  is  expected  as  a  result  of  using  these  ap¬ 
proximations  The  only  exception  to  this  statement  is  with  inventory  systems 
in  which  a  high  shortage  cost  is  specified  and  ordering  costs  are  unusually  low. 


INTRODUCTION 

This  paper  is  concerned  with  Hadley  and  Whitin’s  [4]  approximate  continuous  review 
inventory  model  in  which  a  fixed  quantity  of  an  individual  item  is  ordered  each  item  the  inven¬ 
tory  position  (units  on  hand  plus  units  on  order  minus  backorders)  reaches  the  reorder  point. 
After  a  lead  time  has  elapsed,  the  entire  order  is  received.  It  is  assumed  that  reorder  quantities 
and  reorder  points  are  established  independently  for  each  item  and  that  the  distribution  of  lead 
time  demand  can  be  approximated  by  a  normal  distribution. 

The  optimal  reorder  point  and  reorder  quantity  for  this  model  are  determined  by  minimiz¬ 
ing  a  cost  function  including  the  expected  number  of  orders  placed  per  unit  time,  the  expected 
number  of  backorders  per  unit  time,  and  an  approximation  to  the  expected  holding  cost  per 
unit  time.  The  solution  which  minimizes  this  approximate  cost  function  is  found  by  an  itera¬ 
tive  algorithm  that  converges  quite  rapidly. 

To  find  the  optimal  solution,  it  is  necessary  to  calculate  the  expected  number  of  backord¬ 
ers  per  period  for  a  given  policy.  If  lead  time  demand  is  assumed  to  be  normally  distributed, 
then  this  requires  the  evaluation  of  the  standardized  normal  loss  integral.  Several  authors 
[5,7,8]  have  developed  exponential  functions  to  approximate  the  expected  number  of  backord¬ 
ers  per  period.  This  not  only  alleviates  the  iterative  solution  but  saves  the  table  look-up 
required  to  evaluate  the  normal  loss  integral  This  paper  evaluates  these  approximations. 


1 


170 


D  i  in  km  1 1 


MOD F.l.  DKRIVATION 

The  following  notation,  from  Hadley  and  Whitin  [4]  is  used. 


Q 

/■ 

A 

I 

( 

/ 

JT 

a 


P 

hU) 

M 

<T 

H(r) 

n(r) 


—  order  quantity  (units) 

=  reorder  point  (units) 

=  demand  rate  (units/year) 

=  ordering  eost  (S/order) 

=  eost  of  item  (S/unit) 

=  carrying  cost  (S/S  value  of  stock/year) 

=  backordering  cost  (S/unit  backordered) 

=  probability  of  a  stockout  occurring  during 
a  lead  time 

=  probability  that  any  unit  demanded  cannot  be 
filled  from  stock 

=  probability  density  function  of  lead  time  demand 
=  mean  lead  time  demand  (units) 

=  standard  deviation  of  lead  time  demand 
=  probability  lead  time  demand  exceeds  the  reorder 
point  (complementary  cumulative  distribution) 

=  expected  number  of  backorders  during  a  lead  time  when 
the  reorder  point  is  r  (units) 


The  expected  anual  cost  (AC)  of  operating  the  inventory  system  is  represented  in  the 
equation  below,  assuming  that  shortages  are  backordered. 

( 1 )  AC  =  +  /(  J-^  +  r  —  /uj  +  n  ( r ) . 

The  first  term  represents  the  expected  ordering  cost,  the  second  term  the  expected  carrying 
cost,  and  the  third  term  the  expected  number  of  backorders.  It  is  the  second  term  in  this  equa¬ 
tion  that  is  an  approximation,  since  the  average  inventory  level  is  estimated  as  though  there  are 
no  backorders.  If  the  expected  number  of  backorders  is  small,  the  approximation  is  very  good 
(see  Gross  and  Ince  [3]).  The  third  term  may  also  be  considered  an  approximation  since  the 
lead  time  demand  is  approximated  by  the  normal  distribution. 


The  values  of  Q  and  r  that  minimize  the  above  annual  cost  function  can  be  found  by  the 
simultaneous  solution  of  the  two  equations  below. 


(2) 

H 

—  (A 
1C 

(3) 

Hir)  = 

QIC 

1T\ 

An  iterative  solution  is  suggested  by  Hadley  and  Whilin  [4)  that  will  work  as  long  as  <  1. 

ttK 

This  is  fine,  since  as  H(r)  approaches  1,  the  approximation  to  carrying  cost  becomes  rather 
poor  and,  thus,  the  model  is  not  appropriate. 


In  practice,  it  is  often  difficult  to  estimate  the  backordering  cost,  n.  To  avoid  this  prob¬ 
lem,  one  may  instead  specify  a  desired  service  level.  One  approach  (a  service  policy)  is  to 
specify  a,  the  probability  of  a  stockout  occurring  during  a  lead  time.  A  second  approach  (/3  ser¬ 
vice  policy)  is  to  specify/),  the  probability  that  any  unit  demanded  cannot  be  filled  from  stock. 
Since  v  is  not  specified,  it  can  be  eliminated  from  (2)  and  (3)  above  (see  Nahmias  [6]),  yield¬ 
ing  the  following: 
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Fi(r) 

nit) 

+  2k  A 

//  (  r ) 

HU) 

1C 

In  addition,  then  service  policy  requires 


(5) 


Hir)  =  «. 


And.  the  ii  service  policy  requires 


(6) 


nir) 

Q 


-0- 


Values  of  Q  and  r  may  be  found  directly  from  liquations  (4)  and  (S)  for  the  «  service  policy 
and  values  of  Q  and  r  may  be  found  iteratively  from  Equations  (4)  and  (6)  for  the  0  service 
policy.  See  Nahmias  |6|  for  an  appropriate  algorithm  for  finding  optimal  values  of  Q  and  r. 


FURTHER  APPROXIMATIONS 


When  applying  the  above  model  to  an  inventory  system  with  many  parts,  it  is  typically 
assumed  as  an  approximation  that  the  lead  time  demand  follows  a  particular  distributional  form 
for  all  parts.  A  very  convenient  approximation  and  the  one  assumed  in  this  paper  is  the  normal 
distribution.  That  is,  it  is  assumed  that  //(r)  is  the  probability  density  function  of  the  normal 
distribution  with  the  mean  u  and  standard  deviation  <t.  Therefore,  H(r)  and  n(r)  may  be  cal¬ 
culated  from  (he  equations  below,  where  Z(r)  is  the  probability  density  function  of  the  unit 
normal. 

(7)  //(a)  =  ZU)  ill. 

J  r__  U 

<T 

( 8 )  Ti  ( r )  ~  <r  f 

J  r_  u 
tr 

The  integral  in  (7)  is  the  complementary  cumulative  distribution  of  the  unit  normal  and  is 
tabulated  in  any  standard  statistics  book.  The  integral  in  (8)  is  referred  to  as  the  standardized 
normal  loss  integral  and  is  tabulated  in  Brown  [2],  The  tabulated  integrals  in  (7)  and  (8)  are 
required  to  solve  Equations  (2)  and  (3),  Equations  (4)  and  (5),  and  Equations  (4)  and  (6). 

Two  approximations  have  been  suggested  to  avoid  the  table  look-up  required  by  (7)  and 
(8).  One  approximation,  suggested  by  Schroeder  [8]  and  Herron  [5],  is  to  use  an  exponential 
function,  of  the  form  ae  h',  to  approximate  the  integral  in  Equation  (7).  Using  this  approxima¬ 
tion, 

h\-  "I 

Hir)  =  ae  r  and  Jiir)  =  e 

n 


r-u 

<r 


ZU)  ill. 


The  exponential  approximation  not  only  avoids  the  table  look-up  required  to  calculate 
H(r)  and  n(r)  but  also  avoids  the  iterative  solution  procedures  required  to  find  optimal  values 
of  Q  and  r.  If  the  expression  for  /7(a)  is  substituted  into  the  annual  cost  Equation  (1)  and  par¬ 
tial  derivatives  are  set  equal  to  zero,  the  optima!  value  of  Q  is  as  follows  regardless  of  whether 
it,  a,  or  0  is  specified. 


(9) 


+ 


2A  A  '' 
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The  optimal  value  of  r  is  presented  in  Equation  (10a),  (10b),  and  ( 1 0c)  for  7r  specified,  a 
specified,  and  fi  specified,  respectively. 


(10a) 

(T  . 

r  =  u - -  In 

Q!C 

b 

■rrka 

(10b) 

r  =  u  —  In  (a/ a ). 

b 

(10c) 

tT  i 

r  =  u - -  In 

Ml 

b 

(TO 

Note  that 

the  optimal  values  of  Q  and 

o  not  require  an  iterative  solution.  These  values 
represent  approximate  solutions  when  h(t)  is  assumed  to  be  the  normal  probability  density 
function.  They  are  approximate,  since  the  complementary  cumulative  distribution  function  of 
the  unit  normal  is  approximated  using  an  exponential  function. 

A  second  approximation,  suggested  by  Herron  [5]  and  Parker  [7]  is  to  use  an  exponential 
function  of  the  same  form  to  approximate  the  integral  in  Equation  (8).  Using  this  approxima¬ 


tion,  n(r)  =  true 


and  H(r)  =  abe 


Likewise,  this  approximation  avoids  the  table  look-up  required  to  find  H(r)  and  n(r)  and 
avoids  the  iterative  solution  procedure  required  to  find  optimal  values  of  Q  and  r.  The  optimal 
value  of  Q  is  the  same  as  that  specified  in  Equation  (9)  and  the  optimal  value  of  r  is  presented 
in  Equations  (I  la),  (11b),  and  (11c)  for  tr  specified,  a  specified,  and /3  specified  respectively. 


( l  la) 

r  =  M  " 

(lib) 

r  =  M  " 

(11c) 

r  =  M  - 

IT 

b 

IT 

b 

IT 

b 


ML 


T TkClb 


M 

era 


Thus,  both  approximations  allow  optimal  values  of  £?  and  r  to  be  calculated  directly  and 
avoid  the  problems  of  looking-up  data  in  the  normal  tables.  The  purpose  of  this  paper  is  to 
evaluate  the  accuracy  of  these  approximations. 

PARAMETER  ESTIMATION 

Figure  I  contains  a  plot  of  the  log  of  the  complementary  cumulative  unit  normal  distribu¬ 
tion  and  the  log  of  the  standardized  normal  loss  integral  versus  K,  the  number  of  standard 
deviations  above  zero.  For  the  exponential  functions  to  be  a  good  fit.  these  plots  should  be 
straight  lines.  Obviously,  there  is  a  rather  slow  gradual  curvature  to  both  lines  but  a  straight 
line  does  not  appear  to  be  a  bad  approximation. 

Table  I  contains  the  parameter  estimates  obtained  by  the  various  authors  and  the  range  of 
A'  that  was  used  to  obtain  these  estimates.  Herron  used  two  straight  lines  to  obtain  a  better  fit 
of  the  curvaturd. 

This  author  developed  his  own  parameter  estimates  for  the  standardized  normal  loss 
integral  by  fitting  a  least  square  regression  line  to  twenty-one  points  in  the  range  1.0  ^  K  ^ 
3.0.  This  was  done  in  order  to  evaluate  a  method  analogous  to  that  used  by  Schroeder  [81  for 
the  second  type  of  approximation 
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TABLE  I.  Parameter  Estimates 


Author  Ireferencel 

Estimates 

Range  of 
fit 

a  b 

A.  Approximations 

1  Schroeder  |8) 

2.  Herron  [51 

o  complementary  cumulative  distrib 

2.8800  2.4900 

.5500  .7530 

3.6000  .3870 

ution  of  unit  normal 

1.0  <  K  <  3.0 

0  ^  K  <  1.5 
1.5  <  K  <  3.0 

B.  Approximation  U 

1.  Byrkett 

2.  Herron  (51 

3.  Parker  [71 

standardized  normal  loss  integral 

1.5792  2.6879 

.4400  .5760 

2.4900  .3460 

.4500  1.6949 

1.0  $  K  ^  3.0 

0  s$  K  <  1.5 
1.5  <  A"  <  3.0 

0  <  K  ^  1.4 

The  reader  should  notice  that  Parker  [7]  developed  his  approximation  in  the  range  0  <  K 
<  1.4.  It  is  felt  that  for  most  inventory  systems,  including  those  evaluated  in  this  paper,  it  is 
preferable  to  use  an  approximation  of  the  upper  tail  of  the  distribution,  for  example  1.0  <  K  ^ 
3.0.  For  this  reason,  Parker's  approximation  will  not  be  given  further  evaluation. 

COMPARISON  WITH  TABLED  VALUES 

One  approach  to  measuring  the  accuracy  of  the  approximates  outlined  in  the  previous  sec¬ 
tion  is  to  compare  the  values  obtained  using  the  approximations  with  the  corresponding  tabled 
values.  Table  2  displays  these  results.  The  approximations  to  the  complementary  cumulative 
distribution  were  used  to  compute  tabled  values  of  the  complementary  cumulative  distribution 
at  intervals  of  .05  in  the  range  1.0  <  K  ^  3.0.  Likewise,  the  approximations  to  the  standard¬ 
ized  normal  loss  integral  were  used  to  compute  tabled  values  of  the  standardized  normal  loss 
integral.  Three  criteria  are  used  to  compare  the  approximations  to  the  tabled  values;  the  mean 
absolute  deviation,  the  mean  squared  deviation,  and  the  mean  percentage  deviation. 


TABLE  2.  Comparison  of  Approximations 
to  Tabled  Values* 


Author 

Absolute  deviation 

— 

Squared  deviation 

Percentage  deviation 

Mean  Max. 

Mean  Max. 

Mean 

Max. 

A.  Approximations  to  complementary  cumulative  distribution  of  unit  normal 

1 .  Schroeder  [81 

.0090  .0801 

.000405  .006420 

13 

50 

2.  Herron  [51 

.0025  .0129 

.000015  .000166 

7 

115 

B.  Approximations  to  standardized  normal  loss  integral 

1.  Byrkett 

.0025  .0241 

.000032  .000581 

11 

30 

2.  Herron  [51 

.0010  .0058 

.000003  .000033 

7 

12 

•Entries  in  table  calculated  by  comparing  approximate  value  with  tabled  value  at  intervals  of  .05 
between  A  =  1 .0  and  A  ”  3  0 
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Two  observalions  may  be  made  from  this  table,  hirst,  the  approximations  of  the  stand¬ 
ardized  normal  loss  integral  arc  closer  to  the  tabled  values  than  those  of  the  complementary 
cumulative  distribution  of  the  unit  normal  according  to  all  criteria.  This  indicates  that  the  lower 
curve  in  Figure  1  is  closer  to  linear  than  the  upper  curve  and  that  approximating  this  curve  may 
produce  a  smaller  error.  Second,  Herron’s  two  line  approximation  is  preferable  to  Schroeder's 
and  Byrkett’s  one  line  approximation,  according  to  all  criteria. 

Though  these  results  tend  to  favor  the  estimates  developed  by  Herron  [5|,  they  are  by  no 
means  conclusive  with  respect  to  their  economic  effects  in  controlling  inventory. 

COMPARISON  OF  OPERATING  POLICIES 

The  major  concern  in  using  these  approximations  is  how  much  influence  they  will  have  on 
the  cost  of  operating  an  inventory  system.  It  is  possible  to  simply  use  an  iterative  algorithm  to 
find  optimal  values  of  Q  and  r  and  to  look-up  values  of  H(r)  and  n(r)  from  normal  tables. 
This,  however,  requires  significantly  more  computer  time  than  the  one  or  two  line  approxima¬ 
tions  discussed  above.  For  example,  the  CPU  time  required  to  execute  the  iterative  algorithms 
and  to  look-up  values  of  H(r)  and  Ti(r)  from  normal  tables  for  all  cases  discussed  below  was 
113.44  seconds.  This  compares  with  2.57  seconds  for  the  one  line  approximations  and  4.42 
seconds  for  the  two  line  approximations.  If  you  consider  an  inventory  system  with  many 
thousand  items  and  frequent  updating,  the  savings  in  computer  time  can  be  substantial. 

With  the  exponential  approximations,  it  is  possible  to  calculate  the  optimal  reorder  points 
and  reorder  quantities  directly  without  any  table  look-ups.  To  determine  how  much  this  com¬ 
putational  advantage  costs,  the  average  annual  cost  of  the  solution  of  Equations  (2)  and  (3), 
Equations  (4)  and  (5),  and  Equations  (4)  and  (6)  are  compared  with  the  annual  cost  of  the 
approximations  given  by  Equations  (9)  and  (!0a,b,c)  and  Equations  (9)  and  (lla.b.c).  Equa¬ 
tion  (1 1,  using  a  table  look-up,  will  be  used  to  compare  the  resulting  operating  policies. 

It  is  not  valid  to  compare  the  approximations  on  a  single  item  from  the  inventory,  since 
all  approximations  may  do  equally  well  on  a  given  item.  Rather,  the  approximations  must  be 
compared  on  the  entire  inventory,  or  on  at  least  a  representative  cross  section  of  the  entire 
inventory.  In  this  study,  cross  sections  of  two  different  inventories  are  used  to  compare  the 
approximations.  One  inventory,  called  the  Maintenance  Inventory,  contains  equipment  and 
parts  for  maintaining  a  large  fleet  of  maintenance  vehicles,  including  cars,  trucks,  graders,  and 
so  forth.  Forty  items  were  selected  from  this  inventory  and  estimates  made  of  X.  C,  /u,  and  <r. 
A  second  inventory,  called  the  Warmdot  Inventory,  contains  spare  parts  for  heating  and  air 
conditioning  equipment.  Brown  (ll  contains  X  and  C  for  sixteen  items  from  this  inventory. 
Estimates  were  made  of  /a  and  <r  by  assuming  a  three  month  lead  time  and  using  the  derived 
relationship  in  Brown  (ll,  < r  =  . 21/a  8  C2. 

In  addition  to  comparing  the  approximations  on  two  different  inventories,  all  three 
methods  of  specifying  shortages  costs  (7 r,  a,  and  fi)  are  considered.  The  shortage  costs,  order¬ 
ing  costs,  and  carrying  costs  are  run  at  three  levels  each  to  determine  the  impact  of  these  costs. 
The  costs  used  are  as  follows: 


Ordering  cost 

(A)  _  SI,  $10,  S100 

Carrying  cost 

(1)  -  .1,  .2,  .3 

Shortage  cost 

—  Low,  Medium,  High 

n  -  $10,  $100,  $1000 

«  -  .15,  .10,  .05 

fi  -.10,01,001. 
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Table  3  summarizes  the  results  of  using  the  single  line  approximation  of  Schroeder  [8) 
and  the  two  line  approximation  of  Herron  [51  to  the  complementary  cumulative  distribution  of 
the  unit  normal.  The  numbers  reported  in  these  tables  are  percentage  increases  in  annual  costs 
for  all  sample  parts  in  the  Maintenance  Inventory  for  the  various  cases.  Similarly,  Table  4 
summarizes  the  results  of  using  the  single  line  approximation  by  Byrkett  and  the  two  line 
approximation  by  Herron  151  to  the  standardized  normal  loss  integral.  Again,  these  results  are 
for  the  Maintenance  Inventory.  Similar  results  are  developed  for  the  Warmdot  Inventory 
though  these  have  not  been  included  in  order  to  conserve  space. 


TABLE  3.  Percentage  Increase  in  Annual  Costs  Us inn  Approximations 
to  the  Complementary  Cumulative  Distribution  of  the  Unit  Normal 
(Ma  miena  nee  / n  ventory  ) 


A 

! 

Approximations* 

Shortage  Cost  Specified 

7 r 

or 

P 

10n 

r  ioo 

1000 

.15 

.10 

.05 

.10 

.01 

.001 

1 

** 

1.0 

4.6 

.6 

.3 

.4 

1.3 

4 

■1 

■ 

2 

** 

1.4 

3.6 

2.0 

3.4 

.0 

1.3 

.4 

.5 

1 

** 

1.1 

3.5 

.6 

.3 

•4 

1.3 

.4 

.8 

■ 

2 

** 

.9 

2.9 

2.3 

2.8 

1.5 

.4 

.5 

1 

1 

** 

1.6 

3.1 

.6 

.3 

.4 

1.3 

.4 

.9 

2 

** 

.5 

2.6 

2.4 

4.1 

.0 

1.6 

.4 

.6 

1 

1 

** 

.4 

.4 

.1 

.3 

1.3 

.3 

2 

** 

.6 

.8 

1.5 

.0 

.4 

.1 

2 

10 

1 

1 

** 

.7 

.5 

.2 

4 

1.2 

.4 

2 

** 

.4 

1.6 

1.8 

.0 

.5 

.1 

.2 

7 

1 

** 

1.3 

1.8 

.4 

.1 

.3 

1.1 

.1 

.7 

2 

** 

.1 

1.2 

1.4 

2.4 

.8 

.4 

| 

1 

** 

.2 

.6 

.2 

.0 

.2 

1.4 

.0 

.0 

2 

** 

.1 

,4 

.2 

.3 

.0 

.1 

.1 

.0 

100 

.2 

1 

** 

.7 

.6 

.3 

.1 

.2 

1.6 

.0 

.1 

2 

.1 

.3 

.3 

.4 

.0 

.2 

.2 

.0 

7 

1 

** 

1.7 

,5 

.3 

.1 

.3 

1.5 

.0 

.2 

2 

** 

.2 

.3 

.4 

.7 

.0 

.2 

.1 

.1 

*1  —  Schroeder  181,  2  —  Herron  |5| 

*'  Indicates  that  the  assumption  <  I  was  violated  for  one  or  more  items  in  the  inventory 

7TK 

It  is  difficult  to  draw  conclusions  from  the  raw  data  provided  in  Tables  3  and  4.  Thus, 
Table  5  is  developed  which  summarizes  the  results  in  Tables  3  and  4  and  the  corresponding 
results  for  the  Warmdot  Inventory  by  averaging  these  percentage  increases  across  all  tables  by 
group.  A  regression  model  was  developed  using  the  percentage  increase  in  annual  costs  as  the 
dependent  variable.  The  independent  variables  were  all  0-1  variables  used  to  represent  the 
seven  groups  listed  in  Table  5  and  all  of  the  two  factor  interactions.  One  by  one  the  indepen¬ 
dent  variables  used  to  represent  the  seven  factors  were  removed  from  the  model  to  test  the 
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statistical  relationship  of  the  given  variable  with  the  percentage  increase  in  cost.  An  /‘test  was 
used  with  significance  level  set  of  99%.  This  will  insure  that  the  family  of  seven  tests  has  a 
joint  significance  level  of  at  least  93%.  Based  on  these  seven  tests,  the  variables  found  to  be 
significant  are  the  inventory  system  under  study,  the  level  of  the  ordering  cost,  and  the  level  of 
the  shortage  cost.  The  approximation  approach  and  the  method  of  approximation  were  nearly 
significant  but  not  at  the  confidence  level  specified. 


TABLE  4.  Percentage  Increase  in  Annual  Costs  Using  Approximations 
to  the  Standardized  Normal  Loss  Integral  (Maintenance  Inventory) 


"I  —  Byrkelt.  2  —  Herron  |5| 

Indicates  that  the  assumption  <  I  was  violated  for  one  or  more  items  in  the  inventory. 
ttK 

Several  observations  may  be  made  from  Table  5.  First,  the  average  percentage  increase 
over  all  groups  studied  is  only  .71%.  This  indicates  that  the  approximations  are  quite  effective. 
However,  maximum  percentage  increase  of  12.9%  indicates  that  under  some  conditions  the 
approximations  are  not  so  effective.  Second,  the  approximations  are  much  more  effective  for 
service  level  type  policies  (a  and  /3)  than  for  shortage  cost  type  policies  <7r ) .  Third,  the 
approximations  are  more  effective  for  inventory  systems  in  which  the  ordering  costs  are  rela¬ 
tively  high  (SI00)  than  for  inventory  systems  with  relatively  low  ordering  cost  SI  and  S 1 0 ) , 
Fourth,  the  approximations  are  more  effective  for  inventory  systems  in  which  the  shortage 
costs  are  relatively  low  than  those  with  high  shortage  costs.  Fifth,  though  the  differences  are 
not  great,  it  appears  that  the  two  line  approximations  to  the  standardized  normal  loss  integral 
produces  the  best  results. 

It  was  noted  in  the  previous  paragraph  that  the  inventory  system  under  study  was  found 
to  be  a  significant  factor.  Though  the  difference  in  mean  percentage  increase  is  not  great,  there 
is  significant  interaction  between  the  inventory  system  and  the  specification  of  shortage  cost. 
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the  level  of  ordering  cost,  and  the  level  of  shortage  cost.  The  interaction  between  the  inven¬ 
tory  system  and  the  method  of  specifying  shortage  cost  is  illustrated  in  Table  6A.  Notice  that 
the  approximations  are  more  effective  for  the  maintenance  inventory  system  when  tt  is 
specified  and  vice  versa  when  a  service  level  is  specified  (n  and  fi). 


TABLK  5.  Percentage  Increase  in  Annual 
C  osts  by  Group 


Croup 

Mean 

Standard 

deviation 

Maximum 

Over  all  groups 

.71 

1.24 

12.9 

Approximation  approach 

H(r) 

.82 

1.51 

10.7 

h(r) 

.59 

1.37 

12.9 

Inventory  system* 

Maintenance  inventory 

.62 

.80 

46 

Warmdot  inventory 

.79 

1.84 

12.9 

Specification  of  shortage  cost* 

TT 

1.55 

2.30 

12.9 

it 

.30 

.57 

4.1 

H 

.41 

.59 

2.3 

Approximation 

Single  line  <  1  <  A'  <  3) 

.81 

1.45 

10.7 

Two  lines  (()  $  A  <  3) 

.60 

1.43 

12.9 

Ordering  cost* 

1 

1.26 

2.11 

12.9 

10 

.63 

1.07 

5.5 

100 

.24 

.41 

2.1 

Carrying  cost 

.1 

66 

1.23 

7.1 

.2 

69 

1.35 

8.4 

.3 

.77 

1.71 

12.9 

Shortage  cost  * 

Low 

.57 

65 

2.4 

Medium 

.50 

.90 

5.3 

High 

1 .03 

2.14 

12.9 

’Indicates  this  variable  is  significant  using  I  test  with  n  =  (II 


II 1 R  I  HI  R  APPROXIMATIONS  TO  CONTINUOUS  HI  ATI  W  INVENTORY  MODI  I 


I  70 


'-v 


Other  selected  interactions  are  also  illustrated  in  Table  6.  Table  6B  indicates  that  the 
approximations  are  least  effective  for  inventory  systems  with  low  ordering  costs  in  which  a 
shortage  cost  is  specified.  Table  6C  indicates  that  the  approximations  are  also  least  effective  for 
inventory  systems  with  a  high  specified  shortage  cost.  Table  6D  combines  the  results  of  Tables 
and  6B  and  6C  and  indicates  that  the  approximations  are  least  effective  for  approximations  with 
a  combination  of  a  low  ordering  cost  and  a  high  shortage  cost. 

TABLE  6.  Percentage  Increase  in  Annual  Costs 
for  Selected  Two  Factor  Interactions 

A.  Specification  of  shortage  cost  versus  inventory  system 


Inventory 

system 

Specifications  of  shortage  cost 

i t  a  (i 

Maintenance 

Warmdot 

1.02  .48  .49 

1.90  .11  .34 

B.  Specification  of  shortage  cost  versus  ordering  cost 


Ordering 

cost 

Specification  of  shortage  cost 

7T 

a 

a 

1 

3.01 

.51 

.54 

10 

1.31 

.27 

.41 

100 

.33 

.11 

.28 

C.  Specification  of  shortage  cost  versus  shortage  cost 


Shortage 

cost 

Specification  of  shortage  cost 

7 T 

a 

/* 

Low 

.16 

.41 

.92 

Medium 

.93 

.42 

.15 

High 

2.86 

.07 

.17 

D.  Ordering  cost  versus  shortage  cost 


Shortage 

cost 

Ordering  cost 

1 

10 

100 

Low 

.78 

.56 

.36 

Medium 

.98 

.34 

.18 

High 

1.93 

98 

.19 
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Hadley  and  Whitin's  [4|  approximate  continuous  review  inventory  model  has  received  fre¬ 
quent  analysis  in  the  literature  (recently,  3  and  6|,  though  little  has  been  reported  of  actual  use. 
Perhaps  the  reason  for  this  apparent  lack  of  use  is  the  requirement  that  a  probability  distribu¬ 
tion  be  specified  for  lead  time  demand,  and  the  requirement  that  a  backordering  cost  or  service 
level  be  specified  Moreover,  even  if  one  is  willing  to  specify  the  normal  distribution  for  lead 

lime  demand  and  an  appropriate  backordering  cost  or  service  level,  one  still  may  be  hesitant 

about  using  the  iterative  solution  algorithm  that  requires  the  use  of  normal  tables.  The  purpose 
of  this  paper  is  to  evaluate  some  approximations  that  relieve  the  latter  two  deterrents  to  using 
this  model. 

This  evaluation  produced  the  following  results. 

1  The  exponential  approximations  result  in  operating  policies  very  near  those  of  iterative 
algorithms.  The  average  increase  in  annual  costs  as  a  result  of  using  these  approximations 
is  .71 depending  on  certain  characteristics  of  the  inventory  system. 

2  It  is  preferable  to  approximate  the  standardized  normal  loss  integral  with  an  exponential 
function  than  to  approximate  the  complementary  cumulative  distribution  function. 

3.  The  approximations  are  closer  for  a  or  fi  specified  policies,  than  for  n  specified  policies. 

4  A  single  line  approximation  in  the  range  A  =  1.0  to  A  =  3.0  is  nearly  as  effective  as  a 

two  line  approximation. 

3  The  approximations  are  least  effective  for  inventory  systems  with  low  ordering  costs  and 
high  specified  shortage  costs. 


REFERENCES 

III  Brown.  R.G..  "Estimating  Aggregate  Inventory  Standards."  Naval  Research  Logistics  Quar¬ 
terly  H).  55-71  (1963) 

1 2 1  Brown.  R  (i  .  Deeision  Rules  for  Inventory  Management.  (Holt,  Rinehart,  and  Winston,  New 
York.  N.Y..  (1967). 

13)  Ciross.  I)  and  J E.  Ince.  "A  Comparison  and  Evaluation  of  vApproximate  Continuous 

Review  Inventorv  Models."  International  Journal  of  Production  Research,  /3,  9-23 
(1975) 

1 4]  Hadley,  (i  and  T  M  Whitin.  -Utalysis  of  Inventory  Systems .  (Prentice-f fall.  Englewood 

Cliffs.  New  Jersey  1963) 

1 5 1  Herron.  DP.  "Inventorv  Management  for  Minimum  Cost."  Management  Science  14. 
B2I9-B235  (1967) 

|6|  Nahmias.  S  .  "On  the  Equivalence  of  Three  Approximate  Continuous  Review  Inventory 
Models."  Naval  Research  Logistics  Quarterly,  33.  31-36  (1976). 

|7|  Parker.  I  T.  .  "Economical  Reorder  Quantities  and  Reorder  Points  With  Oncertain 
Demand,"  Naval  Research  Logistics  Quarterly,  II.  351-358  (1964). 

181  Schroeder,  R  (i  .  "Managerial  Inventory  formulations  With  Stockout  Objective  and  Fiscal 
Constraints.”  Naval  Research  Logistics  Quarterly.  3/,  375-388  (1974) 


A  NOTE  ON  THE  MIXTURE  OE  NEW  WORSE 
THAN  USED  IN  EXPECTATION 


Kishan  (i  Mchrotra 

Syracuse  University. 
Syracuse.  .Vctt  York 


I.  INTRODUCTION 


The  class  of  distributions  which  are  new  worse  than  used  in  expectation  (NWUE)  was  first 
introduced  by  Marshall  and  Proschan  [2|.  These  classes  play  an  important  role  in  the  theory  of 
reliability  and  in  particular  arise  quite  naturally  in  considering  replacement  policies.  A  nonnega¬ 
tive  distribution  /'with  survival  function  /  and  expected  value /a  is  said  to  be  NWUE  if 


M 


A  (.v)  dx 


for  all  />(). 


In  this  note  we  are  interested  in  the  following  question:  Is  the  class  of  NWUE  preserved  under 
arbitrary  mixture?  Barlow  and  Proschan  |!i  conjectured  that  NWUE  is  not  preserved  under 
arbitrary  mixtures.  In  section  I  of  this  note  we  present  examples  which  verify  this  conjecture 
and  in  Section  2  we  give  some  other  elementary  properties  of  distribution  of  this  class. 

2.  NWUE  IS  NOT  PRESERVED 

The  examples  considered  below  are  obtained  in  view  of  Lemmas  1  and  2  of  the  next  sec¬ 
tion.  That  is,  we  take  two  specific  NWUE  distributions  F\  and  /\  with  respective  expectations 
M  i  and  M2  such  that  mi_>  M2  _and  /j  crosses  F:  from  above.  Then,  at  a  point  t  beyond  the 
point  of  intersection  of  /j  and  /j  the  defining  equation  of  NWUE  is  not  satisfied. 

Consider  the  mixture 

Fix)  =  y  i/jO  >  +  Fsjx))  lor  all  a  >  0 

where 

/j  (V)  =  r  './iiKi  f()r  v  ^  0 

and 

/•,(.v)  =  e  u  for  ik  -  1)8  ^  .v  <  Aft.  A  =  1,  2 . 


I  hus.  /j  is  the  exponential  distribution  function  with  expected  value  mi  =  -8  and  clearly 
NW'I'E  /  ,fv)  is  a  slight  modification  of  distributions  considered  by  Barlow  and  Proschan  III 
m  Section  S  9  of  Chapter  6.  Since  FAx)  is  easily  seen  to  be  a  NWU,  by  (2  4)  of  Chapter  6  of 
Barlow  and  Proschan  III,  it  is  clearly  NWUE.  The  expected  value  of  the  random  variable  with 
liMribution  function  f  \  is 
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and 

J,  UHlS 

7\(.v)  dx  =  (kd-t)  e  **+— - — 

'  -  1  -  e  * 

where  k  is  an  integer  such  that  (A  —  1)8  <  t  <  AS.  For  a  given  S  set 
HS.t)  =  ~  (aii  +M2<S))~  <7j  U)  +  72</)) 

and 

R  (8./ )  =  f  j/',(-v)  +  /2(.v))  <7v. 

2  J ' 

Then,  for  ft  =  .5.  /ad. 5)  =  .77074.  For  t  =  .5-«,  where  «  is  positive  and  very  small,  for 
instance,  «  =  .001.  LL5.  .5  —  € )  =  .44836  and  R  (.5.  .5  —  « )  =  .44784.  Clearly,  H.5.  .5  — e)  > 

R(.5.  .?  —  «).  Thus,  the  mixture  {-j  7t(.v)  +  F2(x)\  is  not  NWUE.  This  inequality  holds  for 

values  of  t  slightly  less  than  1  and  2. 

The  above  examples  clearly  show  that  NWUE  is  not  preserved  under  the  mixture,  as  con¬ 
jectured  by  Barlow  and  Proschan  [11. 

3.  SOME  ADDITIONAL  PROPERTIES  OF  NWUE  DISTRIBUTION 

LEMMA  1:  Let  7  be  the  class  of  NWUE  distributions  with  equal  mean  /x.  Then  any 
arbitrary  mixture  of  7„  t  7  is  NWUE. 

PROOF:  Let  F  =  J F„  dG(a)  for  arbitrary  distribution  function  G. 

Then. 

ix,  =  /  F(x)dx  =  fj°  f  F„  (x)dG(<x)dx  =  f  lfj°  F„  (x)dxldG(a) 


=  J  p.  dG  (a )  =  n 


where  the  second  inequality  holds  by  Fubini’s  Theorem.  Next,  for  arbitrary  r  >  0, 

X  H.vidv = X  iX  F„  (x)dG («)  1  dx  =  jij;  F  (x)dx]dG  (or) 

^  J  pF„  (l)  dG  (a )  =  /uC(/). 


Thus,  7  is  NWUE. 

LEMMA  2:  Let  7j  and  72  be  two  NWUE  distribution  functions  such  that  F \  crosses  F2 
once  from  below.  Let  /a,  >  ixj  where  /a,  is  the  mean  associated  with  F,.  Then  for  any 
p.  0  ^  p  <  1.7'Lv)  =  pF |(.v)  +  qF2(x)  is  NWUE:  q  =  1  —  p. 


NOTION  NWtll.  DISTRIBUTION  M  INC  I  IONS 


PROOF: 

I  J  Fix)tlx  —  p.  Fit  >!  =  (p  +  q  )  J  \p) '  i(  v)  +  qi  , ( a  )|</v 
—  ip /X  |  +  qpdipT  \  it)  +  qf  i(.v  ) ) 

=  /r  [J*  7\(x)dx —  /u. ,  A'|  ( / )  1  +  </2|  J"(  Pilxidx 

+  /*/[/  {  /■',  <-V  )  +  /M.V  ))</V  —  /X  |  At  (  /  )  —  ^li/|(0l. 

To  show  that  this  expression  is  positive  for  all  /,  it  is  sufficient  to  show  that  the  third  term  is 
positive,  because  the  first  two  terms  are  positive  by  assumption  l\  is  NWUE,/  —  1.2.. 

Let  in  be  the  point  where  i\  crosses  h\  from  below.  Then  for  t  >  uhF\(t)  >  /■'■>< r ) . 

Thus, 

/  \T\ix)  +  FAx)\dx -pJ2(t)-p2Ft(t)^  (pi  | -p.  2 )  ( 7 (/)  —  /-■,(/)  )^0. 


For  t  <  r„. 


J  I  F|  (a  )  +  t  i(.v  )]dx  =  {ju. i /•  2 ( r )  +  m (r ))  ^  i  —  M e )  J* 

,  e'  Td.v)  T,(.v) 

=  (M|  ~M>)  J - 

•'ll  u,  Ml 


/•;(.v)  /  .(V) 

— 1 - - —  dx 

Ml  M2 


,  .  F2(x)  F\(x)  F\(x) 

But  for  t  <  ^  ^  - -  .  Therefore,  the  above  integral  is  positive. 

M2  M  2  Ml 

The  following  result  provides  a  lower  bound  for  the  distribution  function  for  any  member 
of  NWUE  class  in  terms  of  its  expectation. 

LEMMA  3:  If  F  is  a  NWUE  and  ju.  is  its  expectation,  then  Fit)  Js  — ^ —  for  all  /^0. 

I  +  m 

PROOF:  fxF(t)  <  J”  F(x)dx  =  \x-  Fix)dx<  m  -  if'lt)  because  Fix)  >  Fit ) 

for  all  t.  Thus,  Fit) ^  ^ —  or  Fit)  ^  — - — ,  t  >  0. 

M  +  I  m  +  t 

To  show  that  the  above  bound  is  sharp  we  consider  the  following  example.  Let  V  be  a 
nonnegative  random  variable  such  that 

F  IT  =  0|  =  « 

P  10  <  X  <  a  I  =  0 


and  for  X  >  a,  the  density  is  given  by 


fix)  =  ( 1  —  a )  —  (' 
K 


( x  a  h’A 


1X4 
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where  X  =  a  («  -  ( l  -<• )-’)  /  <  1  -<* )’,  and  0<«<  1  is  chosen  so  that  «  -  (1  -  «)2  >0. 

The  expected  value.  /a„,  of  the  random  variable  X  is  given  by 
fi =  ( 1  -  a  )  |a  +  X  | 

and  the  distribution  function  F  is  such  that 

( 1  -  a  )  for  0  ^  x  <  a 
F',(x)  =  (I  -«)  e  ('  k  for  x  >  a. 

Clearly,  F  is  NWUE.  Moreover,  for  x  =  a 

— ^ —  -  — - —  =  (I -a)  -  FJa). 
x  +P„  a  +na 

implving  that  for  any  give  m  >0,  and  for  each  a  >  0,  there  exists  a  NWUE  Fa  such  that 
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ABSTRACT 

l  et  /,  and  1 2  map  |().  /  |  into  the  real  numbers  A  system  is  following  ei¬ 
ther  1 1  or  1 2  and  earning  the  associated  reward  f /j  or  f  respectively.  It  is 
possible  at  any  time  to  switch  from  /,  to  /,  by  paying  a  switching  cost  h  >  I) 
We  determine  a  switching  policy  which  maximizes  the  total  reward  Conditions 
which  guarantee  .1  planning  hon/on  are  established 


INTRODUCTION 

In  many  endeavors  one  must  choose  one  of  two  activities,  each  of  which  has  a  time- 
varying  reward.  There  is  usually  a  cost  associated  with  switching  from  one  activity  to  the  other. 
Such  is  the  case  in  fisheries,  where  a  fisherman  chooses  each  day  to  fit  his  boat  for  deep  or 
shallow  fishing,  and  this  paper  stems  from  a  model  of  such  behavior.  We  model  this  situation 
in  the  following  way.  Let  J\  and  f  :  map  |0,  T]  into  the  real  numbers.  A  system  is  following 
either  J\  or  f2  and  earning  the  associated  reward  f  t)  or  f  f2,  respectively.  It  is  possible  at 
any  time  to  switch  from  /,  to  ./,  by  paying  a  switching  cost  b  >  0.  For  example,  if  the  system 
begins  following  J\.  switches  at  time  f  ^  0  to  /:.  and  then  switches  back  to  J\  at  time  i2  ^  / 1, 
the  total  reward  is 

/,  /•  +/ + /,'/<-  2/>- 

The  problem  of  optimal  switching  between  two  activities  has  been  studied  by  Pekelman 
[2|.  who  required  switching  to  occur  in  a  continuous  fashion  with  a  bounded  rate.  In  our  case, 
switching  occurs  instantaneously.  Pekelman  derived  the  nature  of  an  optimal  poliey  using 
Lagrange  multipliers,  and  showed  the  existence  of  planning  and  forecast  horizons.  Our  prob¬ 
lem  is  simpler,  and  our  analysis  relies  on  dynamic  programming.  We  also  characterize  planning 
horizons. 

REDUCTION  AND  ASSUMPTIONS 

A  function  ./  is  said  to  change  sign  at  /  if. /  lakes  both  negative  and  positive  values  in  any 
neighborhood  of  I.  We  assume 

(Al)The  set  of  points  in  (0,71  where  f\  —  f2  changes  sign  is  nonempty  and  finite.  Let 

0  <  f(  <  /j  <...</„  <  The  an  enumeration  of  this  set. 
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(A2)  There  is  no  interval  in  [0.71  on  which  J\  —  /j  is  identically  zero.  A  model  which  does 
not  satisfy  this  assumption  can  be  reduced  to  one  which  does. 


We  note  that  the  performance  of  any  policy  is  dominated  by  the  performance  of  a  policy 
which  switches  only  on  the  set  jO.  t,.  t2  . ...  /„).  If  a  policy  mandates  a  switch  at 
s  €  ( z* .  4  +  i),  then  the  switch  can  be  relocated  from  s  to  tk.  tk  +  t,  or  to  coincide  with  some 
other  switch  in  (ik.  tkn),  so  that  the  reward  is  not  decreased.  If  two  switches  coincide,  they 
can  both  be  eliminated  with  no  loss  of  reward.  It  is  clear  then  that  the  switch  at  s  «  m  be 
moved  to  tk.  z*  + , ,  or  eliminated  altogether. 

This  observation  restricts  our  attention  to  policies  which  switch  only  at  (0,/! . /„}. 

Since  there  are  only  finitely  many  such  policies,  an  optimal  policy  exists.  Let  6,  =  0  and  define 

r  U  ■  i  , 

A  policy  which  mandates  following  f2  on  the  intervals  I/A ,  ik  1,  0  ^  <  tk  ^ 

t„  earns  reward 

“a,  +  /i  ~  ch’ 

where  c  is  the  number  of  switches  incurred  by  the  policy.  We  have  thus  reduced  our  problem 
to  the  following  sequential  optimization  model. 


m 

I 


M:  At  each  stage  k ,  a  system  is  in  either  stale  0  or  state  1.  A  policy  n  =  (  . /O  is 

a  sequence  such  that  each  nk  maps  {0. 1}  into  (Hold,  Switch)  or  simply  (H,S).  If  the  A:  -th 
state  is.v*,  then  the  A  -th  control  is  uk  =  /x*  (.v*),  the  ( k  +  1 )  -sz  state  is 


(1) 


-Via  +  ii  =  ,/<-v*.ma) 


-f*  if  »a  =  H, 

I  -  xk  if  uk  =  S, 


and  the  reward  associated  with  (xk,uk)  is 


(2) 


Klxk,uk) 


0 

-b  +  ak 
-b 


if  xk  =  0,  uk  =  If, 

if  xk  =  0,  uk  =  S, 

if  xk  =  1,  uk  =  //, 

lf  xk  =  1 .  uk  =  S. 


We  wish  to  find  a  policy  7r  which  maximizes 


(3) 


JJxJ  = 


L  M(xk,uk). 
A-0 


This  is  a  finite  stage,  deterministic,  dynamic  programming  model  with  two  states  and  two 
actions.  The  dynamic  programming  algorithm  for  this  model  is  simple  and  computationally 
feasible.  This  model  has,  however,  a  special  feature  which  leads  to  a  more  efficient  algorithm. 
It  is  apparent  that  whenever  0  ^  ak  ^  2b  (—2 b  ^  at  ^  0),  there  is  nothing  to  be  gained  by 
switching  from  0  to  1  (I  to  0)  at  stage  k  and  back  to  0(1)  at  stage  k  +  1.  To  build  on  this 
observation,  we  define  a  model  more  general  than  M. 

DEFINITION.  We  say  a  dynamic  programming  model  N  is  alternating  if  it  has  two  states 
0  and  1,  two  actions  //  and  ,V,  system  equation  U),  one-stage  reward  (2)  and  objective  func¬ 
tional  (3).  We  require  that  b  ^  0  and  .-fv  =  («»,«! . «„)  is  an  ordered  set  of  real 

numbers  such  that  a„  t*  0.  and  the  nonzero  members  of  the  set  have  alternating  signs.  A  pol¬ 
icy  for  an  alternating  model  is  a  sequence  v  =  i . M  J  such  that  fxk  maps  (0, 1)  into 
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I //..S')  We  say  n  is  feasible  if  a*  =  0  implies  p.  *  Lx* )  =  //,  regardless  of  the  choice  of  .v„.  We 
say  it  is  optimal  if  it  maximizes  J„(x„)  over  all  feasible  policies  (independent  of  x0),  and  the 
reward  7„(x, >)  corresponding  to  an  optimal  it  is  called  the  value  function. 

The  model  M  is  an  alternating  model  with  every  a*  different  from  zero.  Given  an  alter¬ 
nating  model  V,  we  can  construct  a  related  alternating  model  <b  (AT  by  the  following  procedure: 

(P)  Let  m  ^  n  be  the  largest  integer  for  which  nm  ^  0.  If  <«m  and  an  are  the  only  nonzero 

members  of  -l\,  set  («*>=«*  A  =  0 . n.  Otherwise,  determine  the  index 

A,  I  <  A  <  m  -  1,  of  the  smallest  nonzero  la*|.  If  more  than  one  such  la* I  exists, 

choose  the  smallest  index.  If  la*  I  >  2b,  set  cfi(a*)  =  a*,  A  =  0 . n.  If  la* I  ^  2b, 

let  p  =  max{/)|0  <  p  <  A  -  I.  a,,  ;*()),</  =  minimi  A  +  I  ^  q  ^  n,  aq  ^  0),  and  set 

(f)  Ut  k  )  -  ,  A  ^  A  A,  A  If, 

d>  (tt  .1  *  +  (\ !  ,  +  «  . , 

/  phi y 

<f>  (a.)  ~  (f>  (<*  )  =  0. 

*  ‘/ 

The  model  </>(  \  )  is  the  model  V  with  each  a*  replaced  by  </>(a*).  It  is  easily  verified  that 
cM.V)  is  alternating. 

Hither  the  models  </>(  V)  and  V  are  the  same,  or  else  f(N)  is  simpler  than  A’  is  the  sense 
that  contains  more  zeroes  than  4S.  l  or  example,  if  ,-K  =  (- 1 . 3. 0, 0,-2, 5)  and  />==!. 

then  =  (-1.6, 0.0. 0.0). 

LEMMA:  Let  A'  be  an  alternating  model  and  let  <MA)  be  derived  from  \  by  procedure 
IP).  Then  every  optimal  policy  in  <f>  < A )  is  also  optimal  in  N. 

PROOF:  Since  every  feasible  policy  in  </>  ( /V )  is  feasible  in  N  and  leads  to  the  same 
reward  in  both  models,  it  suffices  to  show  that  both  models  have  the  same  value  function.  We 
will  show  this  by  producing  a  policy  which  is  optimal  in  A  and  feasible  in  <b  (A).  There  is  noth¬ 
ing  to  prove  when  fiN)  =  N,  so  we  assume  the  contrary,  i.e.,  la*  I  <  2b. 

The  dynamic  programming  algorithm  for  /V  takes  the  following  form.  For 
k  =  0,  1 . ii,  if  a*  ^  0, 

(4)  7*  (0)  =  max  |7*  +  i  (0),  —b  +  a*  +  7*  +  )  ( 1)} 

(5)  7*(1)  =  max  [a*  +  7*  +  ((l),  - b  +  7*  + 1 ( 0 ) ) , 
while  if  a*  =  0, 

(6)  7*10)  =  7*  +  |(0), 

(7)  •/*<!>  =  7*  +  1(l). 


where  7„+1(0)  =  7„+|(l) 

=  0. 

Define  7r  =  (/u.n, p\  ,  . 

•  -  •  by 

(8)  Ma<0)  = 

\h 

if 

7*(0)  =  7*  +  l(0), 

\S 

if 

7*  (0)  >  7* , ,  (0), 

(9)  M*<1)  = 

\h 

Is 

if 

if 

7*  (1 )  =  a*  +  7*  +  ; 
7*(1)  >  a*  +  7*  +  | 

(1). 

(1). 

The  policy  n-  is  optimal  for  (AT  (I,  p.  50).  We  show  it  is  feasible  for  </>  (A7),  i.e.,  for  any  initial 
slate  x„. 


i 


ISS 


<-'A )  =  H. 
<\>  =  H. 
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(10) 

(11) 


Observe  first  that  (4 ) - ( 7 )  imply 


(12)  Jkm  <  b  +  Jk(\).  A  =  0.  1 . n  +  I. 

(13)  yAm  <  b  +  yA«)),  a  =  o.  i . n  +  l. 


Recall  that  «A  ^  0  and  |aj  ^  2b. 
simplicity  of  notation  that  p  —  k- 


Since  Jrn  =  Jk  and  Jk  +  X  =  JQ,  we  can  and  do  assume  for 
I,  q  =  A  +  1.  Thus,  we  have  _ , |  >  |aA|,  |aA  +  1|  ^ 


We  assume  ak  <  0.  The  other  case  is  treated  similarly.  We  have 
i  >  °.  «oi  >  °-  From  (12)  we  have 
—  h  +  Jk  ,,(0)  ^  /i+^ID  <  “a  +  i  + 
so  (5)  and  <9)  imply 

(14)  yA.,(l)  =  «Atl  +  yA+2<n.  nk  +  x  i\)-H. 

Since  «A  <  0,  (13)  implies 

-h  +  ,tk  +  yA.,(l )  <  -b  +  4^,(1)  <  yA  +  ,<0). 

so  from  (4)  and  (8)  we  have 

(15)  yA(0)  =  yA> ,(()).  M<(0  )=//. 

Since  l«Al  ^  we  have  from  (12). 

-t>  +  A.2<0)  <  Jkt!i I)  <  «A  +«A+I  +  yA+2(i). 

Since  l«A  I  <  2b,  we  also  have 

-2b  +  akt]  +  yA+2(l)  <  «A  +  «AH  -I-  yA+2o). 

Together  with  (4)  and  (14),  these  inequalities  yield 

-b  +  yA+,(o)  <  ttA  +- «A+I  +  yA+2o)  =  «A  +  yA+l(i). 

From  (5)  and  (9)  we  now  have 

06)  yAd)  =  «A +yA+1(i),  MAd>  =  h. 

Equations  (15)  and  (16)  imply  (10).  It  remains  to  establish  (11). 

Since  aAi,  >  0. (5),  (9)  and  (12)  imply  nk  +  ](\)=  H.  If  Jk+2{ 0)  ^  -b+  aA+,+  yA+2(l), 
then  (4)  and  (8)  imply 

4q'0>-  A.2(0).  ma  +  1(0 >=//. 

and  (II)  follows.  On  the  other  hand,  if  yA,2(0)  <  —b  +  «A  +  ]  +  yA+2(l),  then 
(17)  yA  +  i<0)  -  -b  +  «*,,  +  yA<2(l).  M*+1(0)-S. 

and  (ID  will  hold  if  and  only  if  ,vAtl=  I  (independent  of  the  choice  of  x0).  Since 
«A  ,  +  «A  >  0,  (15),  (17),  (14)  and  (16)  imply 
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(())  =  —  />  +  4 

i  +  A  * :  ^  * 

<  -b  +  ak 

1  +  “A  +  “a 

=  -/)  +  ak 

1  +  "A  +  Jk  . 

=  ~b  +  ak 

i  +  y*<D- 

From  <41  and  (8)  we  see  lhal 

(18)  Jk  , (0)  =  ~b  +  <*k  |  +  ^<1).  /a k  | (0)  =  .S'. 

Since  <*k  ,  >  0,  (12)  implies 

-b  +  Jk  (0)  ^  Jk(\)  ^  ak  ,  +  Jk(l). 
and  (5)  and  (9)  yield 

(19)  Jk  ,(1)  =  ak  ,  +  Jk(  1).  m*  ,<!)  =  //- 

Equations  (1),  (18)  and  (19)  imply  ,vA  =  1.  Equations  (1)  and  (16)  imply  ,\k  +  ,  =  I,  as  was  to 
be  proved.  Q.E.D. 

We  slate  now  a  theorem  which  gives  a  simple  construction  of  an  optimal  policy  for  an 
alternating  model  N  for  which  <t>(N)  =  N.  We  show  also  that  any  alternating  model  can  be 
reduced  to  this  case. 


THEOREM:  Let  N  be  alternating  model  for  which  <MA0  =  (V.  If  ,4V  has  only  two 

nonzero  members  ttn  and  then  an  optimal  policy  n  =  (/no.Mi . for  (AO  is  given 

by 


(20) 

(21) 


(22) 


fj.k(xk)  =  //  ,  k  &  0  ,  k  ^  m  , 


M  W  ^  -'  w  1 


S  is  xm  =  0  ,  a,„  >  b 

or  1  .  (x /rl  6, 
H  otherwise. 


5  if  Xu  =  0  .  —  b  +  an  +  Jm  ( 1 )  >  y„,  (0) , 
or  xn  =  1  .  —  b  -k  Jm( 0)  >  «d  4-  y„,(l), 
//  otherwise. 


where 


y„,  (0)  =  max|0,  -h  +<«,„!, 
7„,  ( 1 )  -  max  —b  |. 


If  4\  has  more  than  two  nonzero  members,  then  the  policy  defined  by 


(23) 


Ma  (aa  ) 


S  if  a*  =  0  .  <*i  >  b. 

or  xk  =  1  .  a*  <  -h. 
H  otherwise. 


is  optimal  for  /V.  if  <f>(N)  ^  /V,  then  there  exists  some  positive  integer  i  such  that 
<b‘ + 1  (/V)  =  <2>'(/V),  and  any  optimal  policy  for  «/>'(  Af )  is  also  optimal  for  N. 


i 
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PROOF:  For  the  trivial  case  where  -ls  has  only  two  nonzero  members,  the  optimality  of 
the  policy  given  by  (20M22)  follows  directly  from  the  dynamic  programming  algorithm  14)- 
(9).  Suppose  now  4N  has  three  or  more  nonzero  members  and  m  is  the  largest  index  with 
^  0.  Since  <£(.Y)  =  <V,  0  implies  la*  I  >  2b  for  1  $  A  ^  m  -  1.  The  optimality  of 

(23)  follows  from  (4)-(9),  (12)  and  (13).  Finally,  if  <b(N)  r*5  N,  then  contains  fewer 
nonzero  elements  than  .l\.  After  finitely  many  iterations  of  <6,  we  must  obtain  <6 ' ( .V )  such  that 
ch'  +  1(,V)  =  cb'LV).  0  E  D 

EXISTENCE  OF  PLANNING  HORIZONS. 

Suppose  in  an  alternating  model  .V  we  have  «*  >  2b  for  some  A.  Then,  in  the  notation  of 
procedure  IP),  either  <b(«*)  =  «*,  <£(«*)  =  a*  +  ak  +u(  >  a*.  or<Ma*)  =  0.  The  last 
case  occurs  if  A  =  q,  in  which  case  p  <  A,  <t>(ap)  =  ap+ak  +  a*  >  a*,  and  <t>iatp+l) 
=  ...  =  </> (a* )  =  0.  For  any  /,  we  will  have  either  <A'(<**)  ^  u*  >  2b,  or  else  (/>'(«,)  >  2b, 
where  /  <  A  and  <b'(a/+i)  =  . . .  =  <t>'(ak)  =  0.  If  <b'+l(/V)  =  <t>'(N),  then  the  optimal  policy  of 
the  Theorem  guarantees  that  xk  +  \  =  1.  Thus,  we  can  disregard  at  for  j >  k  +  1  in  determin¬ 
ing  an  optimal  policy  for  stages  0  through  A.  If  a*  <  -2b,  a  similar  argument  holds,  where 
now  we  have  .v*  + 1  =  0. 

In  conclusion,  if  |«J  >  2b,  or  for  any  /  >  1,  |(b'(a*)|  >  2b,  then  we  can  solve  the 
smaller  problem  of  optimal  switching  between  any  stage  /;  <  A  and  stage  A  independent  of  the 
values  of  <*,,  where  /  does  not  satisfy  It  <  j  <  A,  and  the  policy  thereby  obtained  will  be  part 
of  an  optimal  policy  for  the  full  problem. 
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